ARTIFICIAL ANTIBODY POLYPEPTIDES 

5 This application is a divisional of U.S. Serial No. 09/096,749, filed June 12, 

1998, which claims priority under 35 U.S.C. 1 19(e) to provisional application U.S. 
Serial No. 60/049,410, filed June 12, 1997. These applications are incorporated 
herein by reference. 

1 0 FIELD OF THE INVENTION 

The present invention relates generally to the field of the production and 
selection of binding and catalytic polypeptides by the methods of molecular biology, 
using both combinatorial chemistry and recombinant DNA. The invention 
specifically relates to the generation of both nucleic acid and polypeptide libraries 
15 derived therefrom encoding the molecular scaffolding of Fibronectin Type III (Fn3) 
modified in one or more of its loop regions. The invention also relates to the 
"artificial mini-antibodies" or "monobodies," i.e., the polypeptides comprising an 
Fn3 scaffold onto which loop regions capable of binding to a variety of different 
molecular structures (such as antibody binding sites) have been grafted. 

20 

BACKGROUND OF THE INVENTION 

Antibody structure 

A standard antibody (Ab) is a tetrameric structure consisting of two identical 
immunoglobulin (Ig) heavy chains and two identical light chains. The heavy and 

25 light chains of an Ab consist of different domains. Each light chain has one variable 
domain (VL) and one constant domain (CL), while each heavy chain has one 
variable domain (VH) and three or four constant domains (CH) (Alzari et al M 1988). 
Each domain, consisting of ~ 110 amino acid residues, is folded into a characteristic 
P-sandwich structure formed from two p-sheets packed against each other, the 

30 immunoglobulin fold. The VH and VL domains each have three complementarity 
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determining regions (CDR1-3) that are loops, or turns, connecting p~strands at one 
end of the domains (Fig. 1 : A, C). The variable regions of both the light and heavy 
chains generally contribute to antigen specificity, although the contribution of the 
individual chains to specificity is not always equal. Antibody molecules have 
evolved to bind to a large number of molecules by using six randomized loops 
(CDRs). However, the size of the antibodies and the complexity of six loops 
represents a major design hurdle if the end result is to be a relatively small peptide 
ligand. 

Antibody substructures 

Functional substructures of Abs can be prepared by proteolysis and by 
recombinant methods. They include the Fab fragment, which comprises the VH- 
CH1 domains of the heavy chain and the VL-CL1 domains of the light chain joined 
by a single interchain disulfide bond, and the Fv fragment, which comprises only the 
VH and VL domains. In some cases, a single VH domain retains significant affinity 
(Ward et al., 1989). It has also been shown that a certain monomelic k light chain 
will specifically bind to its cognate antigen. (L. Masat et al., 1994). Separated light 
or heavy chains have sometimes been found to retain some antigen-binding activity 
(Ward et al., 1989). These antibody fragments are not suitable for structural 
analysis using NMR spectroscopy due to their size, low solubility or low 
conformational stability. 

Another functional substructure is a single chain Fv (scFv), comprised of the 
variable regions of the immunoglobulin heavy and light chain, covalently connected 
by a peptide linker (S-z Hu et al., 1996). These small (M t 25,000) proteins generally 
retain specificity and affinity for antigen in a single polypeptide and can provide a 
convenient building block for larger, antigen-specific molecules. Several groups 
have reported biodistribution studies in xenografted athymic mice using scFv 
reactive against a variety of tumor antigens, in which specific tumor localization has 
been observed. However, the short persistence of scFvs in the circulation limits the 
exposure of tumor cells to the scFvs, placing limits on the level of uptake. As a 
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result, tumor uptake by scFvs in animal studies has generally been only 1-5% ID/g 
as opposed to intact antibodies that can localize in tumors ad 30-40% ID/g and have 
reached levels as high as 60-70% ID/g. 

A small protein scaffold called a "minibody" was designed using a part of 
the Ig VH domain as the template (Pessi et al., 1993). Minibodies with high affinity 
(dissociation constant (Kj) ~10" 7 M) to interleukin-6 were identified by randomizing 
loops corresponding to CDR1 and CDR2 of VH and then selecting mutants using 
the phage display method (Martin et al., 1994). These experiments demonstrated 
that the essence of the Ab function could be transferred to a smaller system. 
However, the minibody had inherited the limited solubility of the VH domain 
(Bianchi et al., 1994). 

It has been reported that camels (Camelus dromedarius) often lack variable 
light chain domains when IgG-like material from their serum is analyzed, suggesting 
that sufficient antibody specificity and affinity can be derived form VH domains 
(three CDR loops) alone. Davies and Riechmann recently demonstrated that 
"camelized" VH domains with high affinity (K d ~ 10" 7 M) and high specificity can 
be generated by randomizing only the CDR3. To improve the solubility and 
suppress nonspecific binding, three mutations were introduced to the framework 
region (Davies & Riechmann, 1995). It has not been definitively shown, however, 
that camelization can be used, in general, to improve the solubility and stability of 



An alternative to the "minibody" is the "diabody." Diabodies are small 
bivalent and bispecific antibody fragments, i.e., they have two antigen-binding sites. 
The fragments comprise a heavy-chain variable domain (V H ) connected to a light- 
chain variable domain (V L ) on the same polypeptide chain (V H -V L ). Diabodies are 
similar in size to an Fab fragment. By using a linker that is too short to allow 
pairing between the two domains on the same chain, the domains are forced to pair 
with the complementary domains of another chain and create two antigen-binding 
sites. These dimeric antibody fragments, or "diabodies," are bivalent and bispecific. 
P. Holliger et al, PNAS 90:6444-6448 (1993). 
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Since the development of the monoclonal antibody technology, a large 
number of 3D structures of Ab fragments in the complexed and/or free states have 
been solved by X-ray crystallography (Webster et al., 1994; Wilson & Stanfield, 
1994). Analysis of Ab structures has revealed that five out of the six CDRs have 
5 limited numbers of peptide backbone conformations, thereby permitting one to 
predict the backbone conformation of CDRs using the so-called canonical structures 
(Lesk & Tramontano, 1992; Rees et al., 1994). The analysis also has revealed that 
the CDR3 of the VH domain (VH-CDR3) usually has the largest contact surface and 
that its conformation is too diverse for canonical structures to be defined; VH-CDR3 

10 is also known to have a large variation in length (Wu et al, 1993). Therefore, the 
structures of crucial regions of the Ab-antigen interface still need to be 
experimentally determined. 

Comparison of crystal structures between the free and complexed states has 
revealed several types of conformational rearrangements. They include side-chain 

15 rearrangements, segmental movements, large rearrangements of VH-CDR3 and 
changes in the relative position of the VH and VL domains (Wilson & Stanfield, 
1993). In the free state, CDRs, in particular those which undergo large 
conformational changes upon binding, are expected to be flexible. Since X-ray 
crystallography is not suited for characterizing flexible parts of molecules, structural 

20 studies in the solution state have not been possible to provide dynamic pictures of 
the conformation of antigen-binding sites. 

Mimicking the antibody-binding site 

CDR peptides and organic CDR mimetics have been made (Dougall et al., 
25 1994). CDR peptides are short, typically cyclic, peptides which correspond to the 
amino acid sequences of CDR loops of antibodies. CDR loops are responsible for 
antibody-antigen interactions. Organic CDR mimetics are peptides corresponding to 
CDR loops which are attached to a scaffold, e.g., a small organic compound. 

CDR peptides and organic CDR mimetics have been shown to retain some 
30 binding affinity (Smyth & von Itzstein, 1994). However, as expected, they are too 
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small and too flexible to maintain full affinity and specificity. Mouse CDRs have 
been grafted onto the human Ig framework without the loss of affinity (Jones et al., 
1986; Riechmann et al., 1988), though this "humanization" does not solve the 
above-mentioned problems specific to solution studies. 



Mimicking natural selection processes of Abs 

In the immune system, specific Abs are selected and amplified from a large 
library (affinity maturation). The processes can be reproduced in vitro using 
combinatorial library technologies. The successful display of Ab fragments on the 

10 surface of bacteriophage has made it possible to generate and screen a vast number 
of CDR mutations (McCafferty et al., 1990; Barbas et al., 1991; Winter et al., 1994). 
An increasing number of Fabs and Fvs (and their derivatives) is produced by this 
technique, providing a rich source for structural studies. The combinatorial 
technique can be combined with Ab mimics. 

15 A number of protein domains that could potentially serve as protein 

scaffolds have been expressed as fusions with phage capsid proteins. Review in 
Clackson & Wells, Trends Biotechnol. 12:173-184 (1994). Indeed, several of these 
protein domains have already been used as scaffolds for displaying random peptide 
sequences, including bovine pancreatic trypsin inhibitor (Roberts et al., PNAS 

20 89:2429-2433 (1992)), human growth hormone (Lowman et al., Biochemistry 
30:10832-10838 (1991)), Venturini et al., Protein Peptide Letters 1:70-75 (1994)), 
and the IgG binding domain of Streptococcus (O'Neil et al., Techniques in Protein 
Chemistry V (Crabb, L,. ed.) pp. 517-524, Academic Press, San Diego (1994)). 
These scaffolds have displayed a single randomized loop or region. 

25 Researchers have used the small 74 amino acid a-amylase inhibitor 

Tendamistat as a presentation scaffold on the filamentous phage Ml 3 (McConnell 
and Hoess, 1995). Tendamistat is a P-sheet protein from Streptomyces tendae. It 
has a number of features that make it an attractive scaffold for peptides, including its 
small size, stability, and the availability of high resolution NMR and X-ray 

30 structural data. Tendamistat's overall topology is similar to that of an 
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immunoglobulin domain, with two P-sheets connected by a series of loops. In 
contrast to immunoglobulin domains, the p-sheets of Tendamistat are held together 
with two rather than one disulfide bond, accounting for the considerable stability of 
the protein. By analogy with the CDR loops found in immunoglobulins, the loops 
5 the Tendamistat may serve a similar function and can be easily randomized by in 
vitro mutagenesis. 

Tendamistat, however, is derived from Streptomyces tendae. Thus, while 
Tendamistat may be antigenic in humans, its small size may reduce or inhibit its 
antigenicity. Also, Tendamistat's stability is uncertain. Further, the stability that is 

10 reported for Tendamistat is attributed to the presence of two disulfide bonds. 

Disulfide bonds, however, are a significant disadvantage to such molecules in that 
they can be broken under reducing conditions and must be properly formed in order 
to have a useful protein structure. Further, the size of the loops in Tendamistat are 
relatively small, thus limiting the size of the inserts that can be accommodated in the 

15 scaffold. Moreover, it is well known that forming correct disulfide bonds in newly 
synthesized peptides is not straightforward. When a protein is expressed in the 
cytoplasmic space of E. coli, the most common host bacterium for protein 
overexpression, disulfide bonds are usually not formed, potentially making it 
difficult to prepare large quantities of engineered molecules. 

20 Thus, there is an on-going need for small, single-chain artificial antibodies 

for a variety of therapeutic, diagnostic and catalytic applications. 



SUMMARY OF THE INVENTION 

The invention provides a fibronectin type III (Fn3) polypeptide monobody 
25 comprising a plurality of Fn3 P-strand domain sequences that are linked to a 
plurality of loop region sequences. One or more of the monobody loop region 
sequences of the Fn3 polypeptide vary by deletion, insertion or replacement of at 
least two amino acids from the corresponding loop region sequences in wild-type 
Fn3. The P-strand domains of the monobody have at least about 50% total amino 
30 acid sequence homology to the corresponding amino acid sequence of wild-type 
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Fn3's p-strand domain sequences. Preferably, one or more of the loop regions of the 
monobody comprise amino acid residues: 

i) from 15 to 16 inclusive in an AB loop; 

ii) from 22 to 30 inclusive in a BC loop; 
5 iii) from 39 to 45 inclusive in a CD loop; 

iv) from 51 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 

The invention also provides a nucleic acid molecule encoding a Fn3 

10 polypeptide monobody of the invention, as well as an expression vector comprising 
said nucleic acid molecule and a host cell comprising said vector. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody. The method comprises providing a DNA sequence encoding a plurality 
of Fn3 P-strand domain sequences that are linked to a plurality of loop region 

15 sequences, wherein at least one loop region of said sequence contains a unique 

restriction enzyme site. The DNA sequence is cleaved at the unique restriction site. 
Then a preselected DNA segment is inserted into the restriction site. The 
preselected DNA segment encodes a peptide capable of binding to a specific binding 
partner (SBP) or a transition state analog compound (TSAC). The insertion of the 

20 preselected DNA segment into the DNA sequence yields a DNA molecule which 
encodes a polypeptide monobody having an insertion. The DNA molecule is then 
expressed so as to yield the polypeptide monobody. 

Also provided is a method of preparing a Fn3 polypeptide monobody, which 
method comprises providing a replicatable DNA sequence encoding a plurality of 

25 Fn3 P-strand domain sequences that are linked to a plurality of loop region 

sequences, wherein the nucleotide sequence of at least one loop region is known. 
Polymerase chain reaction (PCR) primers are provided or prepared which are 
sufficiently complementary to the known loop sequence so as to be hybridizable 
under PCR conditions, wherein at least one of the primers contains a modified 

30 nucleic acid sequence to be inserted into the DNA sequence. PCR is performed 



7 




using the replicatable DNA sequence and the primers. The reaction product of the 
PCR is then expressed so as to yield a polypeptide monobody. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody. The method comprises providing a replicatable DNA sequence 
5 encoding a plurality of Fn3 P-strand domain sequences that are linked to a plurality 
of loop region sequences, wherein the nucleotide sequence of at least one loop 
region is known. Site-directed mutagenesis of at least one loop region is performed 
so as to create an insertion mutation. The resultant DNA comprising the insertion 
mutation is then expressed. 

10 Further provided is a variegated nucleic acid library encoding Fn3 

polypeptide monobodies comprising a plurality of nucleic acid species encoding a 
plurality of Fn3 P-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein one or more of the monobody loop region sequences vary 
by deletion, insertion or replacement of at least two amino acids from corresponding 

15 loop region sequences in wild- type Fn3, and wherein the p-strand domains of the 
monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequence of P-strand domain sequences of the wild-type 
Fn3. The invention also provides a peptide display library derived from the 
variegated nucleic acid library of the invention. Preferably, the peptide of the 

20 peptide display library is displayed on the surface of a bacteriophage, e.g., a Ml 3 
bacteriophage or a fd bacteriophage, or virus. 

The invention also provides a method of identifying the amino acid sequence 
of a polypeptide molecule capable of binding to a specific binding partner (SBP) so 
as to form a polypeptide:SSP complex, wherein the dissociation constant of the said 

25 polypeptide:SBP complex is less than 10" 6 moles/liter. The method comprises the 
steps of: 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized or 
separable SBP; 

30 c) separating the peptide:SBP complexes from the free peptides; 
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d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 
peptides capable of binding the SBP; 

e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 
peptide sequence capable of binding to the SBP. 

The present invention also provides a method of preparing a variegated 
nucleic acid library encoding Fn3 polypeptide monobodies having a plurality of 
nucleic acid species each comprising a plurality of loop regions, wherein the species 
encode a plurality of Fn3 P-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the P-strand domain sequences 
of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of P-strand domain sequences of the wild-type 
Fn3, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence; 

b) contacting the polypeptide with a specific binding partner (SBP) so 



constant of the said polypeptide: SBP complex is less than 10" 
moles/liter; 

c) determining the binding structure of the polypeptide: SBP complex by 
nuclear magnetic resonance spectroscopy or X-ray crystallography; 
and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 



as to form a polypeptide:SSP complex wherein the dissociation 
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the information provided in (c), result in one or more polypeptides 
with improved binding to the SBP. 
Also provided is a method of identifying the amino acid sequence of a 
polypeptide molecule capable of catalyzing a chemical reaction with a catalyzed rate 
5 constant, k cat , and an uncatalyzed rate constant, k uncat , such that the ratio of k cat /k uncat 
is greater than 10. The method comprises the steps of: 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized or 
separable transition state analog compound (TSAC) representing the 

10 approximate molecular transition state of the chemical reaction; 

c) separating the peptide:TSAC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 

1 5 peptides capable of binding the TSAC; 

e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 

20 peptide sequence. 

The invention also provides a method of preparing a variegated nucleic acid 
library encoding Fn3 polypeptide monobodies having a plurality of nucleic acid 
species each comprising a plurality of loop regions, wherein the species encode a 
plurality of Fn3 P-strand domain sequences that are linked to a plurality of loop 

25 region sequences, wherein one or more of the loop region sequences vary by 

deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the P-strand domain sequences 
of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of P-strand domain sequences of the wild-type 

30 Fn3, comprising the steps of 
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a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k cat , and an 
uncatalyzed rate constant, k^, such that the ratio of k^/k^ is 
greater than 10; 

b) contacting the polypeptide with an immobilized or separable 
transition state analog compound (TSAC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptide:TSAC complex 
by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to or stabilization of the TSAC. 

The invention also provides a kit for the performance of any of the methods 
of the invention. The invention further provides a composition, e.g., a polypeptide, 
prepared by the use of the kit, or identified by any of the methods of the invention. 

The following abbreviations have been used in describing amino acids, 
peptides, or proteins: Ala, or A, Alanine; Arg, or R, Arginine; Asn or N, asparagine; 
Asp, or D, aspartic acid; Cysor C, cystein; Gin, or Q, glutamine; Glu, or E, glutamic 
acid; Gly, or G, glycine; His, or H, histidine; lie, or I, isoleucine; Leu, or L, leucine; 
Lys, or K, lysine; Met, or M, methionine; Phe, or F, phenylalanine; Pro, or P, 
proline; Ser, or S, serine; Thr, or T, threonine; Trp, or W, tryptophan; Tyr, or Y, 
tyrosine; Val, or V, valine. 

The following abbreviations have been used in describing nucleic acids, 
DNA, or RNA: A, adenosine; T, thymidine; G, guanosine; C, cytosine. 



BRIEF DESCRIPTION OF THE DRAWINGS 



11 





Figure 1 A. p-Strand and loop topology of anti-lysozyme immunoglobulin 
D1.3. (Bhat et al., 1994). The locations of complementarity determining regions 
(CDRs, hypervariable regions) are indicated. 

Figure IB. p-Strand and loop topology of the 10th type III domain of human 
5 fibronectin. (Main et al., 1992) The locations of the integrin-binding Arg-Gly- Asp 
(RGD) sequence is indicated. 

Figure 1C. MOLSCRIPT representation of anti-lysozyme immunoglobulin 
D1.3. (Fraulis, 1991; Bhat et al., 1994) The locations of complementarity 
determining regions (CDRs, hypervariable regions) are indicated. 
10 Figure ID. MOLSCRIPT representation of the 10th type III domain of 

human fibronectin. (Kraulis, 1991; Main et al., 1992) The locations of the integrin- 
binding Arg-Gly- Asp (RGD) sequence is indicated. 

Figure 2. Amino acid sequence (SEQ ID NO: 1 10) and restriction sites of the 
synthetic Fn3 gene. The residue numbering is according to Main et al. (1992). 
15 Restriction enzyme sites designed are shown above the amino acid sequence. 

P-Strands are denoted by underlines. The N-terminal "mq" sequence has been added 
for a subsequent cloning into an expression vector. The His tag (Novagen) fusion 
protein has an additional sequence, MGSSHHHHHHSSGLVPRGSH (SEQ ID 
NO:l 14), preceding the Fn3 sequence shown above. 
20 Figure 3 A. Far UV CD spectra of wild-type Fn3 at 25°C and 90°C. Fn3 (50 

\iM) was dissolved in sodium acetate (50 mM, pH 4.6). 

Figure 3B. Thermal denaturation of Fn3 monitored at 215 nm. Temperature 
was increased at a rate of l°C/min. 

Figure 4A. Ca trace of the crystal structure of the complex of lysozyme 
25 (HEL) and the Fv fragment of the anti-hen egg-white lysozyme (anti-HEL) antibody 
D1.3 (Bhat et al, 1994). Side chains of the residues 99-102 of VH CDR3, which 
make contact with HEL, are also shown. 

Figure 4B. Contact surface area for each residue of the Dl .3 VH-HEL and 
VH-VL interactions plotted vs. residue number of D1.3 VH. Surface area and 
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secondary structure were determined using the program DSSP (Kabsh and Sander, 
1983). 

Figure 4C. Schematic drawings of the P-sheet structure of the F strand-loop- 
G strand moieties of D 1.3 VH. The boxes denote residues in P-strands and ovals 
5 those not in strands. The shaded boxes indicate residues of which side chains are 
significantly buried. The broken lines indicate hydrogen bonds. 

Figure 4D. Schematic drawings of the P-sheet structure of the F strand-loop- 
G strand moieties of Fn3. The boxes denote residues in P-strands and ovals those 
not in strands. The shaded boxes indicate residues of which side chains are 
10 significantly buried. The broken lines indicate hydrogen bonds. 

Figure 5. Designed Fn3 gene showing DNA and amino acid sequences 
(SEQ ID NO: 1 1 1 and SEQ ID NO: 1 12) . The amino acid numbering is according to 
Main et al. (1992). The two loops that were randomized in combinatorial libraries 
are enclosed in boxes. 

15 Figure 6. Map of plasmid pAS45. Plasmid pAS45 is the expression vector 

of Histag-Fn3. 

Figure 7. Map of plasmid pAS25. Plasmid pAS25 is the expression vector 

ofFn3. 

Figure 8. Map of plasmid pAS38. pAS38 is a phagmid vector for the 

20 surface display of Fn3. 

Figure 9. (Ubiquitin-1) Characterization of ligand-specific binding of 
enriched clones using phage enzyme-linked immunosolvent assay (ELISA). 
Microtiter plate wells were coated with ubiquitin (1 |ig/well; "Ligand (+)) and then 
blocked with BSA. Phage solution in TBS containing approximately 10 10 colony 

25 forming units (cfu) was added to a well and washed with TBS. Bound phages were 
detected with anti-phage antibody-POD conjugate (Pharmacia) with Turbo-TMB 
(Pierce) as a substrate. Absorbance was measured using a Molecular Devices 
SPECTRAMAX 250 microplate spectrophotometer. For a control, wells without 
the immobilized ligand were used. 2-1 and 2-2 denote enriched clones from Library 



13 



• • 

2 eluted with free ligand and acid, respectively. 4-1 and 4-2 denote enriched clones 
from Library 4 eluted with free ligand and acid, respectively. 

Figure 10. (Ubiquitin-2) Competition phage ELISA of enriched clones. 
Phage solutions containing approximately 10 10 cfu were first incubated with free 
5 ubiquitin at 4°C for 1 hour prior to the binding to a ligand-coated well. The wells 
were washed and phages detected as described above. 

Figure 1 1 . Competition phage ELISA of ubiquitin-binding monobody 411. 
Experimental conditions are the same as described above for ubiquitin. The ELISA 
was performed in the presence of free ubiquitin in the binding solution. The 
10 experiments were performed with four different preparations of the same clone. 
Figure 12. (Fluorescein-1) Phage ELISA of four clones, pLB25.1 
(containing SEQ ID NO:l 15), pLB25.4 (containing SEQ ID NO:l 16), pLB24.1 
(containing SEQ ID NO: 117) and pLB24.3 (containing SEQ ID NO:l 18). 
Experimental conditions are the same as ubiquitin- 1 above. 
15 Figure 13. (Fluorescein-2) Competition ELISA of the four clones. 

Experimental conditions are the same as ubiquitin-2 above. 

Figure 14. ! H, 15 N-HSQC spectrum of a fluorescence-binding monobody 
LB25.5. Approximately 20 \xm protein was dissolved in 10 mm sodium acetate 
buffer (pH 5.0) containing 100 mM sodium chloride. The spectrum was collected at 
20 30°C on a Varian Unity INOVA 600 NMR spectrometer. 

Figure 15 A. Characterization of the binding reaction of Ubi4-Fn3 to the 
target, ubiquitin. Phage ELISA analysis of binding of Ubi4-Fn3 to ubiquitin. The 
binding of Ubi4-phages to ubiquitin-coated wells was measured. The control 
experiment was performed with wells containing no ubiquitin. 
25 Figure 15B. Competition phage ELISA of Ubi4-Fn3. Ubi4-Fn3 phages 

were preincubated with soluble ubiquitin at an indicated concentration, followed by 
the phage ELISA detection in ubiquitin-coated wells. 

Figure 15C. Competition phage ELISA testing the specificity of the Ubi4 
clone. The Ubi4 phages were preincubated with 250 jig/ml of soluble proteins, 
30 followed by phage ELISA as in (b). 
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Figure 15D. ELISA using free proteins. 

Figure 16. Equilibrium unfolding curves for Ubi4-Fn3 (closed symbols) and 
wild-type Fn3 (open symbols). Squares indicate data measured in TBS (Tris HCI 
buffer (50 mM, pH 7.5) containing NaCl (150 mM)). Circles indicate data 
5 measured in Gly HCI buffer (20 mM, pH 3.3) containing NaCl (300 mM). The 
curves show the best fit of the transition curve based on the two-state model. 
Parameters characterizing the transitions are listed in Table 7. 

Figure 17. (a) ] H, 15 N-HSQC spectrum of [ 15 N]-Ubi4-K Fn3. (b). 
Difference (6^.^ - 8^) of ! H (b) and 15 N (c) chemical shifts plotted versus 
10 residue number. Values for residues 82-84 (shown as filled circles) where Ubi4-K 
deletions are set to zero. Open circles indicate residues that are mutated in the 
Ubi4-K protein. The locations of p-strands are indicated with arrows. 



DETAILED DESCRIPTION OF THE INVENTION 

15 For the past decade the immune system has been exploited as a rich source 

of de novo catalysts. Catalytic antibodies have been shown to have 
chemoselectivity, enantioselectivity, large rate accelerations, and even an ability to 
reroute chemical reactions. In most cases the antibodies have been elicited to 
transition state analog (TSA) haptens. These TSA haptens are stable, low-molecular 

20 weight compounds designed to mimic the structures of the energetically unstable 
transition state species that briefly (approximate half-life 10' 13 s) appear along 
reaction pathways between reactants and products. Anti-TSA antibodies, like 
natural enzymes, are thought to selectively bind and stabilize transition state, 
thereby easing the passage of reactants to products. Thus, upon binding, the 

25 antibody lowers the energy of the actual transition state dnd increases the rate of the 
reaction. These catalysts can be programmed to bind to geometrical and 
electrostatic features of the transition state so that the reaction route can be 
controlled by neutralizing unfavorable charges, overcoming entropic barriers, and 
dictating stereoelectronic features of the reaction. By this means even reactions that 

30 are otherwise highly disfavored have been catalyzed (Janda et al. 1997). Further, in 

15 




many instances catalysts have been made for reactions for which there are no known 
natural or manmade enzymes. 

The success of any combinatorial chemical system in obtaining a particular 
function depends on the size of the library and the ability to access its members. 
5 Most often the antibodies that are made in an animal against a hapten that mimics 
the transition state of a reaction are first screened for binding to the hapten and then 
screened again for catalytic activity. An improved method allows for the direct 
selection for catalysis from antibody libraries in phage, thereby linking chemistry 
and replication. 

10 A library of antibody fragments can be created on the surface of filamentous 

phage viruses by adding randomized antibody genes to the gene that encodes the 
phage's coat protein. Each phage then expresses and displays multiple copies of a 
single antibody fragment on its surface. Because each phage possesses both the 
surface-displayed antibody fragment and the DNA that encodes that fragment, and 

1 5 antibody fragment that binds to a target can be identified by amplifying the 
associated DNA. 

Immunochemists use as antigens materials that have as little chemical 
reactivity as possible. It is almost always the case that one wishes the ultimate 
antibody to interact with native structures. In reactive immunization the concept is 

20 just the opposite. One immunizes with compounds that are highly reactive so that 
upon binding to the antibody molecule during the induction process, a chemical 
reaction ensues. Later this same chemical reaction becomes part of the mechanism 
of the catalytic event. In a certain sense one is immunizing with a chemical reaction 
rather than a substance per se. Reactive immunogens can be considered as 

25 analogous to the mechanism-based inhibitors that enzymologists use except that 
they are used in the inverse way in that, instead of inhibiting a mechanism, they 
induce a mechanism. 

Man-made catalytic antibodies have considerable commercial potential in 
many different applications. Catalytic antibody-based products have been used 

30 successfully in prototype experiments in therapeutic applications, such as prodrug 
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activation and cocaine inactivation, and in nontherapeutic applications, such as 
biosensors and organic synthesis. 

Catalytic antibodies are theoretically more attractive than noncatalytic 
antibodies as therapeutic agents because, being catalytic, they may be used in lower 
doses, and also because their effects are unusually irreversible (for example, peptide 
bond cleavage rather than binding). In therapy, purified catalytic antibodies could 
be directly administered to a patient, or alternatively the patient's own catalytic 
antibody response could be elicited by immunization with an appropriate hapten. 
Catalytic antibodies also could be used as clinical diagnostic tools or as 
regioselective or stereoselective catalysts in the synthesis of fine chemicals. 

I. Mutation of Fn3 loops and grafting of Ab loops onto Fn3 

An ideal scaffold for CDR grafting is highly soluble and stable. It is small 
enough for structural analysis, yet large enough to accommodate multiple CDRs so 
as to achieve tight binding and/or high specificity. 

A novel strategy to generate an artificial Ab system on the framework of an 
existing non-Ab protein was developed. An advantage of this approach over the 
minimization of an Ab scaffold is that one can avoid inheriting the undesired 
properties of Abs. Fibronectin type III domain (Fn3) was used as the scaffold. 
Fibronectin is a large protein which plays essential roles in the formation of 
extracellular matrix and cell-cell interactions; it consists of many repeats of three 
types (I, II and III) of small domains (Baron et al., 1991). Fn3 itself is the paradigm 
of a large subfamily (Fn3 family or s-type Ig family) of the immunoglobulin 
superfamily (IgSF). The Fn3 family includes cell adhesion molecules, cell surface 
hormone and cytokine receptors, chaperonins, and carbohydrate-binding domains 
(for reviews, see Bork & Doolittle, 1992; Jones, 1993; Bork et al, 1994; Campbell 
& Spitzfaden, 1994; Harpez & Chothia, 1994). 

Recently, crystallographic studies revealed that the structure of the DNA 
binding domains of the transcription factor NF-kB is also closely related to the Fn3 
fold (Ghosh et al., 1995; Miiller et al., 1995). These proteins are all involved in 
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specific molecular recognition, and in most cases ligand-binding sites are formed by 
surface loops, suggesting that the Fn3 scaffold is an excellent framework for 
building specific binding proteins. The 3D structure of Fn3 has been determined by 
NMR (Main et al., 1992) and by X-ray crystallography (Leahy et al., 1992; 
5 Dickinson et al., 1994). The structure is best described as a P-sandwich similar to 
that of Ab VH domain except that Fn3 has seven P-strands instead of nine (Fig. 1). 
There are three loops on each end of Fn3; the positions of the BC, DE and FG loops 
approximately correspond to those of CDRI, 2 and 3 of the VH domain, respectively 
(Fig. 1 C, D). 

10 Fn3 is small (~ 95 residues), monomeric, soluble and stable. It is one of few 

members of IgSF that do not have disulfide bonds; VH has an interstrand disulfide 
bond (Fig. 1 A) and has marginal stability under reducing conditions. Fn3 has been 
expressed in E. coli (Aukhil et al., 1993). In addition, 17 Fn3 domains are present 
just in human fibronectin, providing important information on conserved residues 

15 which are often important for the stability and folding (for sequence alignment, see 
Main et al., 1992 and Dickinson et al., 1994). From sequence analysis, large 
variations are seen in the BC and FG loops, suggesting that the loops are not crucial 
to stability. NMR studies have revealed that the FG loop is highly flexible; the 
flexibility has been implicated for the specific binding of the 10th Fn3 to a 5 Pj 

20 integrin through the Arg-Gly- Asp (RGD) (SEQ ID NO : 1 1 3) motif In the crystal 
structure of human growth hormone-receptor complex (de Vos et aL, 1992), the 
second Fn3 domain of the receptor interacts with hormone via the FG and BC loops, 
suggesting it is feasible to build a binding site using the two loops. 

The tenth type III module of fibronectin has a fold similar to that of 

25 immunoglobulin domains, with seven p strands forming two antiparallel P sheets, 
which pack against each other (Main et al., 1992). The structure of the type II 
module consists of seven P strands, which form a sandwich of two antiparallel P 
sheets, one containing three strands (ABE) and the other four strands (C'CFG) 
(Williams et al., 1988). The triple-stranded p sheet consists of residues Glu-9-Thr- 

30 14 (A), Ser-17-Asp-23 (B), and Thr-56-Ser-60 (E). The majority of the conserved 
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residues contribute to the hydrophobic core, with the invariant hydrophobic residues 
Trp-22 and Try-68 lying toward the N-terminal and C-terminal ends of the core, 
respectively. The p strands are much less flexible and appear to provide a rigid 
framework upon which functional, flexible loops are built The topology is similar 
5 to that of immunoglobulin C domains. 

Gene construction and mutagenesis 

A synthetic gene for tenth Fn3 of human fibronectin (Fig. 2) was designed 
which includes convenient restriction sites for ease of mutagenesis and uses specific 

10 codons for high-level protein expression (Gribskov et al, 1984). 

The gene was assembled as follows: (1) the gene sequence was divided into 
five parts with boundaries at designed restriction sites (Fig. 2); (2) for each part, a 
pair of oligonucleotides that code opposite strands and have complementary 
overlaps of ~ 15 bases was synthesized; (3) the two oligonucleotides were annealed 

15 and single strand regions were filled in using the Klenow fragment of DNA 
polymerase; (4) the double-stranded oligonucleotide was cloned into the pET3a 
vector (Novagen) using restriction enzyme sites at the termini of the fragment and 
its sequence was confirmed by an Applied Biosystems DNA sequencer using the 
dideoxy termination protocol provided by the manufacturer; (5) steps 2-4 were 

20 repeated to obtain the whole gene (plasmid pAS25) (Fig. 7). 

Although the present method takes more time to assemble a gene than the 
one-step polymerase chain reaction (PCR) method (Sandhu et al., 1992), no 
mutations occurred in the gene. Mutations would likely have been introduced by the 
low fidelity replication by Taq polymerase and would have required 

25 time-consuming gene editing. The gene was also cloned into the pET15b 

(Novagen) vector (pEWl). Both vectors expressed the Fn3 gene under the control 
of bacteriophage T7 promoter (Studler et al. 1990); pAS25 expressed the 96-residue 
Fn3 protein only, while pEWl expressed Fn3 as a fusion protein with poly-histidine 
peptide (His»tag). Recombinant DNA manipulations were performed according to 

30 Molecular Cloning (Sambrook et al., 1989), unless otherwise stated. 
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Mutations were introduced to the Fn3 gene using either cassette mutagenesis 
or oligonucleotide site-directed mutagenesis techniques (Deng & Nickoloff, 1992). 
Cassette mutagenesis was performed using the same protocol for gene construction 
described above; double-stranded DNA fragment coding a new sequence was cloned 
5 into an expression vector (pAS25 and/or pEWl). Many mutations can be made by 
combining a newly synthesized strand (coding mutations) and an oligonucleotide 
used for the gene synthesis. The resulting genes were sequenced to confirm that the 
designed mutations and no other mutations were introduced by mutagenesis 
reactions. 

10 

Design and synthesis of Fn3 mutants with antibody CDRs 

Two candidate loops (FG and BC) were identified for grafting. Antibodies 
with known crystal structures were examined in order to identify candidates for the 
sources of loops to be grafted onto Fn3. Anti-hen egg lysozyme (HEL) antibody 

15 D1.3 (Bhat et al., 1994) was chosen as the source of a CDR loop. The reasons for 
this choice were: (1) high resolution crystal structures of the free and complexed 
states are available (Fig. 4 A; Bhat et al., 1994), (2) thermodynamics data for the 
binding reaction are available (Tello et al., 1993), (3) D1.3 has been used as a 
paradigm for Ab structural analysis and Ab engineering (Verhoeyen et al., 1988; 

20 McCafferty et al., 1990) (4) site-directed mutagenesis experiments have shown that 
CDR3 of the heavy chain (VH-CDR3) makes a larger contribution to the affinity 
than the other CDRs (Hawkins et al., 1993), and (5) a binding assay can be easily 
performed. The objective for this trial was to graft VH-CDR3 of Dl .3 onto the Fn3 
scaffold without significant loss of stability. 

25 An analysis of the Dl .3 structure (Fig. 4) revealed that only residues 99-102 

("RDYR") make direct contact with hen egg-white lysozyme (HEL) (Fig. 4 B), 
although VH-CDR3 is defined as longer (Bhat et al., 1994). It should be noted that 
the C-terminal half of VH-CDR3 (residues 101-104) made significant contact with 
the VL domain (Fig. 4 B). It has also become clear that Dl .3 VH-CDR3 (Fig. 4 C) 

30 has a shorter turn between the strands F and G than the FG loop of Fn3 (Fig. 4 D). 
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Therefore, mutant sequences were designed by using the RDYR (99-102) of D 1.3 as 
the core and made different boundaries and loop lengths (Table 1). Shorter loops 
may mimic the D 1.3 CDR3 conformation better, thereby yielding higher affinity, 
but they may also significantly reduce stability by removing wild-type interactions 
ofFn3. 
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Table 1. Amino acid sequences of D 1.3 VH CDR3, VH8 CDR3 and Fn3 FG loop 
and list of planned mutants. 
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Underlines indicate residues in P-strands. Bold characters indicate replaced 
residues. 



In addition, an anti-HEL single VH domain termed VH8 (Ward et al., 1989) 
was chosen as a template. VH8 was selected by library screening and, in spite of the 
lack of the VL domain, VH8 has an affinity for HEL of 27 nM, probably due to its 
longer VH-CDR3 (Table 1). Therefore, its VH-CDR3 was grafted onto Fn3. 
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Longer loops may be advantageous on the Fn3 framework because they may 
provide higher affinity and also are close to the loop length of wild-type Fn3. The 
3D structure of VH8 was not known and thus the VH8 CDR3 sequence was aligned 
with that of D1.3 VH-CDR3; two loops were designed (Table 1). 

Mutant construction and production 

Site-directed mutagenesis experiments were performed to obtain designed 
sequences. Two mutant Fn3s, D 1.3-1 and DL3-4 (Table 1) were obtained and both 
were expressed as soluble His-tag fusion proteins. D1.34 was purified and the 
His.tag portion was removed by thrombin cleavage. Dl .3-4 is soluble up to at least 

1 mM at pH 7.2. No aggregation of the protein has been observed during sample 
preparation and NMR data acquisition. 

Protein expression and purification 

E. coli BL21 (DE3) (Novagen) were transformed with an expression vector 
(pAS25, pEWl and their derivatives) containing a gene for the wild-type or a 
mutant. Cells were grown in M9 minimal medium and M9 medium supplemented 
with Bactotrypton (Difco) containing ampicillin (200 jig/ml). For isotopic labeling, 
15 N NH 4 CI and/or 13 C glucose replaced unlabeled components. 500 mL medium in a 

2 liter baffle flask were inoculated with 10 ml of overnight culture and agitated at 
37°C. Isopropylthio-P-galactoside (IPTG) was added at a final concentration of 1 
mM to initiate protein expression when OD (600 nm) reaches one. The ceils were 
harvested by centrifugation 3 hours after the addition of IPTG and kept frozen at 
-70°C until used. 

Fn3 without His-tag was purified as follows. Cells were suspended in 5 
ml/(g cell) of Tris (50 mM, pH 7.6) containing ethylenediaminetetraacetic acid 
(EDTA; 1 mM) and phenylmethylsulfonyl fluoride (1 mM). HEL was added to a 
final concentration of 0.5 mg/ml. After incubating the solution for 30 minutes at 
37°C, it was sonicated three times for 30 seconds on ice. Cell debris was removed 
by centrifugation. Ammonium sulfate was added to the solution and precipitate 
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recovered by centrifugation. The pellet was dissolved in 5-10 ml sodium acetate (50 
mM, pH 4.6) and insoluble material was removed by centrifugation. The solution 
was applied to a SEPHACRYL S100HR column (Pharmacia) equilibrated in the 
sodium acetate buffer. Fractions containing Fn3 then was applied to a 
5 RESOURCES® column (Pharmacia) equilibrated in sodium acetate (50 mM, pH 
4.6) and eluted with a linear gradient of sodium chloride (0-0.5 M). The protocol 
can be adjusted to purify mutant proteins with different surface charge properties. 

Fn3 with His»tag was purified as follows. The soluble fraction was prepared 
as described above, except that sodium phosphate buffer (50 mM, pH 7.6) 

10 containing sodium chloride (100 mM) replaced the Tris buffer. The solution was 
applied to a HI-TRAP chelating column (Pharmacia) preloaded with nickel and 
equilibrated in the phosphate buffer. After washing the column with the buffer, 
His«tag-Fn3 was eluted in the phosphate buffer containing 50 mM EDTA. Fractions 
containing His«tag-Fn3 were pooled and applied to a SEPHACRYL SI 00-HR 

15 column, yielding highly pure protein. The His»tag portion was cleaved off by 
treating the fusion protein with thrombin using the protocol supplied by Novagen. 
Fn3 was separated from the His»tag peptide and thrombin by a RESOURCES® 
column using the protocol above. 

The wild-type and two mutant proteins so far examined are expressed as 

20 soluble proteins. In the case that a mutant is expressed as inclusion bodies 

(insoluble aggregate), it is first examined if it can be expressed as a soluble protein 
at lower temperature (e.g., 25-30°C). If this is not possible, the inclusion bodies are 
collected by low-speed centrifugation following cell lysis as described above. The 
pellet is washed with buffer, sonicated and centrifuged. The inclusion bodies are 

25 solubilized in phosphate buffer (50 mM, pH 7.6) containing guanidinium chloride 
(GdnCl, 6 M) and will be loaded on a HI-TRAP chelating column. The protein is 
eluted with the buffer containing GdnCl and 50 mM EDTA. 

Conformation of mutant Fn3, Dl.3-4 
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The l H NMR spectra of His»tag D 1.3-4 fusion protein closely resembled that 
of the wild- type, suggesting the mutant is folded in a similar conformation to that of 
the wild- type. The spectrum of Dl .3-4 after the removal of the His»tag peptide 
showed a large spectral dispersion. A large dispersion of amide protons (7-9.5 ppm) 
and a large number of downfield (5.0-6.5 ppm) C a protons are characteristic of a P- 
sheet protein (Wiithrich, 1986). 

The 2D NOESY spectrum of D1.34 provided further evidence for a 
preserved conformation. The region in the spectrum showed interactions between 
upfield methyl protons (< 0.5 ppm) and methyl-methylene protons. The Val72 y 
methyl resonances were well separated in the wild-type spectrum (-0.07 and 0.37 
ppm; (Baron et al., 1992)). Resonances corresponding to the two methyl protons are 
present in the Dl.3-4 spectrum (-0.07 and 0.44 ppm). The cross peak between these 
two resonances and other conserved cross peaks indicate that the two resonances in 
the Dl.3-4 spectrum are highly likely those of Val72 and that other methyl protons 
are in nearly identical environment to that of wild-type Fn3. Minor differences 
between the two spectra are presumably due to small structural perturbation due to 
the mutations. Val72 is on the F strand, where it forms a part of the central 
hydrophobic core of Fn3 (Main et al., 1992). It is only four residues away from the 
mutated residues of the FG loop (Table 1). The results are remarkable because, 
despite there being 7 mutations and 3 deletions in the loop (more than 10% of total 
residues; Fig. 12, Table 2), Dl.3-4 retains a 3D structure virtually identical to that of 
the wild-type (except for the mutated loop). Therefore, the results provide strong 
support that the FG loop is not significantly contributing to the folding and stability 
of the Fn3 molecule and thus that the FG loop can be mutated extensively. 
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Structure and stability measurements 

Structures of Abs were analyzed using quantitative methods (e.g., DSSP 
(Kabsch & Sander, 1983) and PDBFIT (D. McRee, The Scripps Research Institute)) 
5 as well as computer graphics (e.g., QUANTA (Molecular Simulations) and WHAT IF 
(G. Vriend, European Molecular Biology Laboratory)) to superimpose the 
strand- loop-strand structures of Abs and Fn3. 

The stability of FnAbs was determined by measuring temperature- and 
chemical denaturant-induced unfolding reactions (Pace et al., 1989). The 
10 temperature-induced unfolding reaction was measured using a circular dichroism 
(CD) polarimeter. Ellipticity at 222 and 215 nm was recorded as the sample 
temperature was slowly raised. Sample concentrations between 10 and 50 |iM were 
used. After the unfolding baseline was established, the temperature was lowered to 
examine the reversibility of the unfolding reaction. Free energy of unfolding was 
15 determined by fitting data to the equation for the two-state transition (Becktel & 
Schellman, 1987; Pace et al, 1989). Nonlinear least-squares fitting was performed 
using the program IGOR (WaveMetrics) on a Macintosh computer. 

The structure and stability of two selected mutant Fn3s were studied; the first 
mutant was D1.34 (Table 2) and the second was a mutant called AS40 which contains 
20 four mutations in the BC loop (A 26 V 27 T 28 V 29 ) - TQRQ). AS40 was randomly chosen 
from the BC loop library described above. Both mutants were expressed as soluble 
proteins in E. coli and were concentrated at least to 1 mM, permitting NMR studies. 

The mid-point of the thermal denaturation for both mutants was approximately 
69°C, as compared to approximately 79°C for the wild-type protein. The results 
25 indicated that the extensive mutations at the two surface loops did not drastically 
decrease the stability of Fn3, and thus demonstrated the feasibility of introducing a 
large number of mutations in both loops. 

Stability was also determined by guanidinium chloride (GdnCl)- and urea- 
induced unfolding reactions. Preliminary unfolding curves were recorded using a 
30 fluorometer equipped with a motor-driven syringe; GdnCl or urea were added 
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continuously to the protein solution in the cuvette. Based on the preliminary 
unfolding curves, separate samples containing varying concentration of a denaturant 
were prepared and fluorescence (excitation at 290 nm, emission at 300-400 nm) or CD 
(ellipticity at 222 and 215 nm) were measured after the samples were equilibrated at 
5 the measurement temperature for at least one hour. The curve was fitted by the 

least-squares method to the equation for the two-state model (Santoro & Bolen, 1988; 
Koide et al., 1993). The change in protein concentration was compensated if required. 

Once the reversibility of the thermal unfolding reaction is established, the 
unfolding reaction is measured by a Microcal MC-2 differential scanning calorimeter 
10 (DSC). The cell (~ 1.3 ml) will be filled with FnAb solution (0.1-1 mM) and ACp (= 
AH/ AT) will be recorded as the temperature is slowly raised. T m 
(the midpoint of unfolding), AH of unfolding and AG of unfolding is determined by 
fitting the transition curve (Privalov & Potekhin, 1986) with the ORIGIN software 
provided by Microcal. 

15 

Thermal unfolding 

A temperature-induced unfolding experiment on Fn3 was performed using 
circular dichroism (CD) spectroscopy to monitor changes in secondary structure. The 
CD spectrum of the native Fn3 shows a weak signal near 222 nm (Fig. 3 A), consistent 

20 with the predominantly P-structure of Fn3 (Perczel et al., 1992). A cooperative 
unfolding transition is observed at 80-90°C, clearly indicating high stability of Fn3 
(Fig. 3B). The free energy of unfolding could not be determined due to the lack of a 
post-transition baseline. The result is consistent with the high stability of the first Fn3 
domain of human fibronectin (Litvinovich et al, 1992), thus indicating that Fn3 

25 domains are in general highly stable. 

Binding assays 

Binding reaction of FnAbs were characterized quantitatively using an 
isothermal titration calorimeter (ITC) and fluorescence spectroscopy. 
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The enthalpy change (AH) of binding were measured using a Microcal 
OMEGA ITC (Wiseman et al., 1989). The sample cell (~ 1.3 ml) was filled with 
FnAbs solution (< 100 |iM, changed according to KJ, and the reference cell filled 
with distilled water; the system was equilibrated at a given temperature until a stable 
5 baseline is obtained; 5-20 of ligand solution (< 2 mM) was injected by a 

motordriven syringe within a short duration (20 sec) followed by an equilibration 
delay (4 minutes); the injection was repeated and heat generation/absorption for each 
injection was measured. From the change in the observed heat change as a function of 
ligand concentration, AH and K d was determined (Wiseman et al., 1989). AG and AS 

10 of the binding reaction was deduced from the two directly measured parameters. 
Deviation from the theoretical curve was examined to assess nonspecific 
(multiplesite) binding. Experiments were also performed by placing a ligand in the 
cell and titrating with an FnAb. It should be emphasized that only ITC gives direct 
measurement of AH, thereby making it possible to evaluate enthalpic and entropic 

15 contributions to the binding energy. ITC was successfully used to monitor the binding 
reaction of the D1.3 Ab (Tello et al., 1993; Bhat et al, 1994). 

Intrinsic fluorescence is monitored to measure binding reactions with K d in the 
sub-|iM range where the determination of K d by ITC is difficult. Tip fluorescence 
(excitation at ~ 290 nm, emission at 300-350 nm) and Tyr fluorescence (excitation at 

20 ~ 260 nm, emission at ~ 303 nm) is monitored as the Fn3-mutant solution (^ 10 \iM) 
is titrated with ligand solution (<; 100 \iM). K d of the reaction is determined by the 
nonlinear least-squares fitting of the bimolecular binding equation. Presence of 
secondary binding sites is examined using Scatchard analysis. In all binding assays, 
control experiments are performed busing wild-type Fn3 (or unrelated FnAbs) in place 

25 of FnAbs of interest. 

II. Production of Fn3 mutants with high affinity and specificity FnAbs 

Library screening was carried out in order to select FnAbs which bind to 
specific ligands. This is complementary to the modeling approach described above. 
30 The advantage of combinatorial screening is that one can easily produce and screen a 




large number of variants 10 ), which is not feasible with specific mutagenesis 
("rational design") approaches. The phage display technique (Smith, 1985; O'Neil & 
Hoess, 1995) was used to effect the screening processes. Fn3 was fused to a phage 
coat protein (pHI) and displayed on the surface of filamentous phages. These phages 
harbor a single-stranded DNA genome that contains the gene coding the Fn3 fusion 
protein. The amino acid sequence of defined regions of Fn3 were randomized using a 
degenerate nucleotide sequence, thereby constructing a library. Phages displaying 
Fn3 mutants with desired binding capabilities were selected in vitro, recovered and 
amplified. The amino acid sequence of a selected clone can be identified readily by 
sequencing the Fn3 gene of the selected phage. The protocols of Smith (Smith & 
Scott, 1993) were followed with minor modifications. 

The objective was to produce FnAbs which have high affinity to small protein 
ligands. HEL and the Bl domain of staphylococcal protein G (hereafter referred to as 
protein G) were used as ligands. Protein G is small (56 amino acids) and highly stable 
(Minor & Kim, 1994; Smith et al, 1994). Its structure was determined by NMR 
spectroscopy (Gronenbom et al., 1991) to be a helix packed against a four-strand 
P-sheet. The resulting FnAb-protein G complexes (~ 150 residues) is one of the 
smallest protein-protein complexes produced to date, well within the range of direct 
NMR methods. The small size, the high stability and solubility of both components 
and the ability to label each with stable isotopes ( 13 C and 15 N; see below for protein G) 
make the complexes an ideal model system for NMR studies on protein-protein 
interactions. 

The successful loop replacement of Fn3 (the mutant D1.34) demonstrate that 
at least ten residues can be mutated without the loss of the global fold. Based on this, 
a library was first constructed in which only residues in the FG loop are randomized. 
After results of loop replacement experiments on the BC loop were obtained, mutation 
sites were extended that include the BC loop and other sites. 

Construction of Fn3 phage display system 
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An Ml 3 phage-based expression vector pASMl has been constructed as 
follows: an oligonucleotide coding the signal peptide of OmpT was cloned at the 5 ! 
end of the Fn3 gene; a gene fragment coding the C-terminal domain of Ml 3 pill was 
prepared from the wild-type gene III gene of Ml 3 mp 18 using PCR (Corey et al., 
1993) and the fragment was inserted at the 3' end of the OmpT-Fn3 gene; a spacer 
sequence has been inserted between Fn3 and pill. The resultant fragment 
(OmpTFn3-pIII) was cloned in the multiple cloning site of Ml 3 mpl8, where the 
fusion gene is under the control of the lac promoter. This system will produce the 
Fn3-pIII fusion protein as well as the wild-type pill protein. The co-expression of 
wild-type pill is expected to reduce the number of fusion pill protein, thereby 
increasing the phage infectivity (Corey et al., 1993) (five copies of pill are present on 
a phage particle). In addition, a smaller number of fusion pill protein may be 
advantageous in selecting tight binding proteins, because the chelating effect due to 
multiple binding sites should be smaller than that with all five copies of fusion pill 
(Bass et al., 1990). This system has successfully displayed the serine protease trypsin 
(Corey et al., 1993). Phages were produced and purified using E. coli K91kan (Smith 
& Scott, 1993) according to a standard method (Sambrook et al., 1989) except that 
phage particles were purified by a second polyethylene glycol precipitation and acid 
precipitation. 

Successful display of Fn3 on fusion phages has been confirmed by ELISA 
using an Ab against fibronectin (Sigma), clearly indicating that it is feasible to 
construct libraries using this system. 

An alternative system using the fUSE5 (Parmley & Smith, 1988) may also be 
used. The Fn3 gene is inserted to fUSE5 using the Sfil restriction sites introduced at 
the 5'- and 3'- ends of the Fn3 gene PCR. This system displays only the fusion pill 
protein (up to five copies) on the surface of a phage. Phages are produced and 
purified as described (Smith & Scott, 1993). This system has been used to display 
many proteins and is robust. The advantage of fUSE5 is its low toxicity. This is due 
to the low copy number of the replication form (RF) in the host, which in turn makes 
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it difficult to prepare a sufficient amount of RF for library construction (Smith & 
Scott, 1993). 

Construction of libraries 

5 The first library was constructed of the Fn3 domain displayed on the surface of 

MB phage in which seven residues (77-83) in the FG loop (Fig. 4D) were randomized. 
Randomization will be achieved by the use of an oligonucleotide containing 
degenerated nucleotide sequence. A double-stranded nucleotide was prepared by the 
same protocol as for gene synthesis (see above) except that one strand had an 

10 (NNK) 6 (TSfNG) sequence at the mutation sites, where N corresponds to an equimolar 
mixture of A, T, G and C and K corresponds to an equimolar mixture of G and T. The 
(NNG) codon at residue 83 was required to conserve the SacI restriction site (Fig. 2). 
The (NNK) codon codes all of the 20 amino acids, while the NNG codon codes 14. 
Therefore, this library contained ~ 10 9 independent sequences. The library was 

1 5 constructed by ligating the double-stranded nucleotide into the wild-type phage 
vector, pASMl, and the transfecting E. coli XL1 blue (Stratagene) using 
electroporation. XL1 blue has the lacl q phenotype and thus suppresses the expression 
of the Fn3-pIII fusion protein in the absence of lac inducers. The initial library was 
propagated in this way, to avoid selection against toxic Fn3-pIII clones. Phages 

20 displaying the randomized Fn3-pIII fusion protein were prepared by propagating 
phages with K91kan as the host. K91kan does not suppress the production of the 
fusion protein, because it does not have lacl q . Another library was also generated in 
which the BC loop (residues 26-20) was randomized. 

25 Selection of displayed Fn Abs 

Screening of Fn3 phage libraries was performed using the biopanning protocol 
(Smith & Scott, 1993); a ligand is biotinylated and the strong biotinstreptavidin 
interaction was used to immobilize the ligand on a streptavidin-coated dish. 
Experiments were performed at room temperature (~22 °C). For the initial recovery of 
30 phages from a library, 10 |ig of a biotinylated ligand were immobilized on a 
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streptavidin-coated polystyrene dish (35 mm, Falcon 1008) and then a phage solution 
(containing ~ 10 11 pfu (plaque-forming unit)) was added. After washing the dish with 
an appropriate buffer (typically TBST, Tris-HCl (50 mM, pH 7.5), NaCl (150 mM) 
and Tween 20 (0.5%)), bound phages were eluted by one or combinations of the 
5 following conditions: low pH, an addition of a free ligand, urea (up to 6 M) and, in the 
case of anti-protein G FnAbs, cleaving the protein G-biotin linker by thrombin. 
Recovered phages were amplified using the standard protocol using K91kan as the 
host (Sambrook et al, 1989). The selection process were repeated 3-5 times to 
concentrate positive clones. From the second round on, the amount of the ligand were 
10 gradually decreased (to ~ 1 jxg) and the biotinylated ligand were mixed with a phage 
solution before transferring a dish (G. P. Smith, personal communication). After the 
final round, 10-20 clones were picked, and their DNA sequence will be determined. 
The ligand affinity of the clones were measured first by the phage-ELISA method (see 
below). 

15 To suppress potential binding of the Fn3 framework (background binding) to a 

ligand, wild-type Fn3 may be added as a competitor in the buffers. In addition, 
unrelated proteins (e.g., bovine serum albumin, cytochrome c and RNase A) may be 
used as competitors to select highly specific FnAbs. 

20 Binding assay 

The binding affinity of FnAbs on phage surface is characterized 
semiquantitatively using the phage ELISA technique (Li et al., 1995). Wells of 
microtiter plates (Nunc) are coated with a ligand protein (or with streptavidin 
followed by the binding of a biotinylated ligand) and blocked with the BLOTTO 

25 solution (Pierce). Purified phages (~ 10 10 pfu) originating from single plaques 

(M13)/colonies (fUSE5) are added to each well and incubated overnight at 4°C. After 
washing wells with an appropriate buffer (see above), bound phages are detected by 
the standard ELISA protocol using anti-M13 Ab (rabbit, Sigma) and anti-rabbit 
Ig-peroxidase conjugate (Pierce) or using anti-M13 Ab-peroxidase conjugate 

30 (Pharmacia). Colormetric assays are performed using TMB 
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(3,3^5, 5'-tetramethylbenzidine, Pierce). The high affinity of protein G to 
immunoglobulins present a special problem; Abs cannot be used in detection. 
Therefore, to detect anti-protein G FnAbs, fusion phages are immobilized in wells and 
the binding is then measured using biotinylated protein G followed by the detection 
5 using streptavidin-peroxidase conjugate. 

Production of soluble FnAbs 

After preliminary characterization of mutant Fn3s using phage ELISA, mutant 
genes are subcloned into the expression vector pEWl. Mutant proteins are produced 

10 as His»tag fusion proteins and purified, and their conformation, stability and ligand 
affinity are characterized. 

Thus, Fn3 is the fourth example of a monomelic immunoglobulin-like scaffold 
that can be used for engineering binding proteins. Successful selection of novel 
binding proteins have also been based on minibody, tendamistat and "camelized" 

15 immunoglobulin VH domain scaffolds (Martin et al., 1994; Davies & Riechmann, 
1995; McConnell & Hoess, 1995). The Fn3 scaffold has advantages over these 
systems. Bianchi et al. reported that the stability of a minibody was 2.5 kcal/mol, 
significantly lower than that of Ubi4-K. No detailed structural characterization of 
minibodies has been reported to date. Tendamistat and the VH domain contain 

20 disulfide bonds, and thus preparation of correctly folded proteins may be difficult. 
Davies and Riechmann reported that the yields of their camelized VH domains were 
less than 1 mg per liter culture (Davies & Riechmann, 1996). 

Thus, the Fn3 framework can be used as a scaffold for molecular recognition. 
Its small size, stability and well-characterized structure make Fn3 an attractive system. 

25 In light of the ubiquitous presence of Fn3 in a wide variety of natural proteins 

involved in ligand binding, one can engineer Fn3-based binding proteins to different 
classes of targets. 

The following examples are intended to illustrate but not limit the invention. 
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EXAMPLE I 



Construction of the Fn3 gene 



10 



15 



20 



A synthetic gene for tenth Fn3 of fibronectin (Fig. 1) was designed on the 
basis of amino acid residue 1416-1509 of human fibronectin (Kornblihtt, et al 9 1985) 
and its three dimensional structure (Main, et al. 9 1992). The gene was engineered to 
include convenient restriction sites for mutagenesis and the so-called "preferred 
codons" for high level protein expression (Gribskov, et al 9 1984) were used. In 
addition, a glutamine residue was inserted after the N-terminal methionine in order to 
avoid partial processing of the N-terminal methionine which often degrades NMR 
spectra (Smith, et al., 1994). Chemical reagents were of the analytical grade or better 
and purchased from Sigma Chemical Company and J.T. Baker, unless otherwise 
noted. Recombinant DNA procedures were performed as described in "Molecular 
Cloning" (Sambrook, et ai, 1989), unless otherwise stated. Custom oligonucleotides 
were purchased from Operon Technologies. Restriction and modification enzymes 
were from New England Biolabs. 

The gene was assembled in the following manner. First, the gene sequence 
(Fig. 5) was divided into five parts with boundaries at designed restriction sites: 
fragment 1, Ndel-PstI (oligonucleotides FN IF and FN1R (Table 2); fragment 2, 
Pstl-EcoRI (FN2F and FN2R); fragment 3, EcoRI-Sall (FN3F and FN3R); fragment 
4, Sall-SacI (FN4F and FN4R); fragment 5, SacI-BamHI (FN5F and FN5R). Second, 
for each part, a pair of oligonucleotides which code opposite strands and have 
complementary overlaps of approximately 15 bases was synthesized. These 
oligonucleotides were designated FN1F-FN5R and are shown in Table 2. Third, each 
pair (e.g., FN1F and FN1R) was annealed and single-strand regions were filled in 
using the Klenow fragment of DNA polymerase. Fourth, the double stranded 
oligonucleotide was digested with the relevant restriction enzymes at the termini of 
the fragment and cloned into the PBLUESCRIPT SK plasmid (Stratagene) which had 
been digested with the same enzymes as those used for the fragments. The DNA 
sequence of the inserted fragment was confirmed by DNA sequencing using an 




Applied Biosystems DNA sequencer and the dideoxy termination protocol provided 
by the manufacturer. Last, steps 2-4 were repeated to obtain the entire gene. 

The gene was also cloned into the pET3a and pET15b (Novagen) vectors 
(pAS45 and pAS25, respectively). The maps of the plasmids are shown in Figs. 6 
5 and 7. E. coli BL21 (DE3) (Novagen) containing these vectors expressed the Fn3 
gene under the control of bacteriophage T7 promotor (Studier, et al., 1990); pAS24 
expresses the 96-residue Fn3 protein only, while pAS45 expresses Fn3 as a fusion 
protein with poly-histidine peptide (His»tag). High level expression of the Fn3 
protein and its derivatives in E. coli was detected as an intense band on SDS-PAGE 
10 stained with CBB. 

The binding reaction of the monobodies is characterized quantitatively by 
means of fluorescence spectroscopy using purified soluble monobodies. 

Intrinsic fluorescence is monitored to measure binding reactions. Trp 
fluorescence (excitation at -290 nm, emission at 300 350 nm) and Tyr fluorescence 
15 (excitation at -260 nm, emission at -303 nm) is monitored as the Fn3 -mutant solution 
(< 100 [iM) is titrated with a ligand solution. When a ligand is fluorescent (e.g. 
fluorescein), fluorescence from the ligand may be used. K d of the reaction will be 
determined by the nonlinear least-squares fitting of the bimolecular binding equation. 

If intrinsic fluorescence cannot be used to monitor the binding reaction, 
20 monobodies are labeled with fluorescein-NHS (Pierce) and fluorescence polarization 
is used to monitor the binding reaction (Burke et al, 1996). 

EXAMPLE II 
Modifications to include restriction sites in the Fn3 gene 

25 The restriction sites were incorporated in the synthetic Fn3 gene without 

changing the amino acid sequence Fn3. The positions of the restriction sites were 
chosen so that the gene construction could be completed without synthesizing long 
(>60 bases) oligonucleotides and so that two loop regions could be mutated (including 
by randomization) by the cassette mutagenesis method (i.e., swapping a fragment with 

30 another synthetic fragment containing mutations). In addition, the restriction sites 
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were chosen so that most sites were unique in the vector for phage display. Unique 
restriction sites allow one to recombine monobody clones which have been already 
selected in order to supply a larger sequence space. 



5 EXAMPLE III 

Construction of M13 phage display libraries 

A vector for phage display, pAS38 (for its map, see Fig. 8) was constructed as 
follows. The Xbal-BamHI fragment of pET12a encoding the signal peptide of OmpT 
was cloned at the 5 1 end of the Fn3 gene. The C-terminal region (from the FN5F and 

10 FN5R' oligonucleotides, see Table 2) of the Fn3 gene was replaced with a new 
fragment consisting of the FN5F and FN5R' oligonucleotides (Table 2) which 
introduced a Mlul site and a linker sequence for making a fusion protein with the pill 
protein of bacteriophage Ml 3. A gene fragment coding the C-terminal domain of 
Ml 3 pill was prepared from the wild-type gene III of M13mpl8 using PCR (Corey, et 

15 aL, 1993) and the fragment was inserted at the 3* end of the OmpT-Fn3 fusion gene 
using the Mlul and Hindlll sites. 

Phages were produced and purified using a helper phage, M13K07, according 
to a standard method (Sambrook, et al, 1989) except that phage particles were 
purified by a second polyethylene glycol precipitation. Successful display of Fn3 on 

20 fusion phages was confirmed by ELISA (Harlow & Lane, 1988) using an antibody 
against fibronectin (Sigma) and a custom anti-FN3 antibody (Cocalico Biologicals, 
PA, USA). 



EXAMPLE IV 

25 Libraries containing loop variegations in the AB loop 

A nucleic acid phage display library having variegation in the AB loop is 
prepared by the following methods. Randomization is achieved by the use of 
oligonucleotides containing degenerated nucleotide sequence. Residues to be 
variegated are identified by examining the X-ray and NMR structures of Fn3 (Protein 
30 Data Bank accession numbers, 1FNA and 1TTF, respectively). Oligonucleotides 
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containing NNK (N and K here denote an equimolar mixture of A, T, G, and C and an 
equimolar mixture of G and T 5 respectively) for the variegated residues are 
synthesized (see oligonucleotides BC3, FG2, FG3, and FG4 in Table 2 for example). 
The NNK mixture codes for all twenty amino acids and one termination codon 
5 (TAG). TAG, however, is suppressed in the E. coli XL-1 blue. Single-stranded 
DNAs of pAS38 (and its derivatives) are prepared using a standard protocol 
(Sambrook, etal 9 1989). 

Site-directed mutagenesis is performed following published methods (see for 
example, Kunkel, 1985) using a MUTA-GENE kit (BioRad). The libraries are 

10 constructed by electroporation of E. coli XL-1 Blue electroporation competent cells 
(200 (xl; Stratagene) with l|ig of the plasmid DNA using a BTX electrocell 
manipulator ECM 395 1mm gap cuvette. A portion of the transformed cells is plated 
on an LB-agar plate containing ampicillin (100 |ig/ml) to determine the transformation 
efficiency. Typically, 3 X 10 8 transformants are obtained with 1 jag of DNA, and thus 

15 a library contains 10 8 to 10 9 independent clones. Phagemid particles were prepared as 
described above. 



EXAMPLE V 
Loop variegations in the BC, CD, DE, EF or FG loop 

20 A nucleic acid phage display library having five variegated residues (residues 

number 26-30) in the BC loop, and one having seven variegated residues (residue 
numbers 78-84) in the FG loop, was prepared using the methods described in Example 
IV above. Other nucleic acid phage display libraries having variegation in the CD, 
DE or EF loop can be prepared by similar methods. 

25 

EXAMPLE VI 
Loop variegations in the FG and BC loop 

A nucleic acid phage display library having seven variegated residues 
(residues number 78-84) in the FG loop and five variegated residues (residue number 
30 26-30) in the BC loop was prepared. Variegations in the BC loop were prepared by 
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site-directed mutagenesis (Kunkel, et al) using the BC3 oligonucleotide described in 
Table 1. Variegations in the FG loop were introduced using site-directed mutagenesis 
using the BC loop library as the starting material, thereby resulting in libraries 
containing variegations in both BC and FG loops. The oligonucleotide FG2 has 
variegating residues 78-84 and oligonucleotide FG4 has variegating residues 77-81 
and a deletion of residues 82-84. 

A nucleic acid phage display library having five variegated residues (residues 
78-84) in the FG loop and a three residue deletion (residues 82-84) in the FG loop, 
and five variegated residues (residues 26-30) in the BC loop, was prepared The 
shorter FG loop was made in an attempt to reduce the flexibility of the FG loop; the 
loop was shown to be highly flexible in Fn3 by the NMR studies of Main, et al 
(1992). A highly flexible loop may be disadvantageous to forming a binding site with 
a high affinity (a large entropy loss is expected upon the ligand binding, because the 
flexible loop should become more rigid). In addition, other Fn3 domains (besides 
human) have shorter FG loops (for sequence alignment, see Figure 12 in Dickinson, et 
al (1994)). 

Randomization was achieved by the use of oligonucleotides containing 
degenerate nucleotide sequence (oligonucleotide BC3 for variegating the BC loop and 
oligonucleotides FG2 and FG4 for variegating the FG loops). 

Site-directed mutagenesis was performed following published methods (see 
for example, Kunkel, 1985). The libraries were constructed by electro transforming E. 
coli XL-1 Blue (Stratagene). Typically a library contains 10 8 to 10 9 independent 
clones. Library 2 contains five variegated residues in the BC loop and seven 
variegated residues in the FG loop. Library 4 contains five variegated residues in each 
of the BC and FG loops, and the length of the FG loop was shortened by three 
residues. 
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EXAMPLE VII 
fd phage display libraries constructed with loop variegations 




Phage display libraries are constructed using the fd phage as the genetic 
vector. The Fn3 gene is inserted in fUSE5 (Parmley & Smith, 1988) using Sfll 
restriction sites which are introduced at the 5' and 3' ends of the Fn3 gene using PCR. 
The expression of this phage results in the display of the fusion pill protein on the 
5 surface of the fd phage. Variegations in the Fn3 loops are introduced using 

site-directed mutagenesis as described hereinabove, or by subcloning the Fn3 libraries 
constructed in Ml 3 phage into the fUSE5 vector. 



EXAMPLE VIII 
10 Other phage display libraries 

T7 phage libraries (Novagen, Madison, WI) and bacterial pili expression 
systems (Invitrogen) are also useful to express the Fn3 gene. 



EXAMPLE IX 

15 Isolation of polypeptides which bind to macromolecular structures 

The selection of phage-displayed monobodies was performed following the 
protocols of Barbas and coworkers (Rosenblum & Barbas, 1995). Briefly, 
approximately 1 \ig of a target molecule ("antigen") in sodium carbonate buffer (100 
mM, pH 8.5) was immobilized in the wells of a microtiter plate (Maxisorp, Nunc) by 

20 incubating overnight at 4° C in an air tight container. After the removal of this 

solution, the wells were then blocked with a 3% solution of BSA (Sigma, Fraction V) 
in TBS by incubating the plate at 37°C for 1 hour. A phagemid library solution (50 
|il) containing approximately 10 12 colony forming units (cfu) of phagemid was 
absorbed in each well at 37*C for 1 hour. The wells were then washed with an 

25 appropriate buffer (typically TBST, 50 mM Tris-HCl (pH 7.5), 1 50 mM NaCl , and 
0.5% Tween20) three times (once for the first round). Bound phage were eluted by an 
acidic solution (typically, 0.1 M glycine-HCl, pH 2.2; 50 |il) and recovered phage 
were immediately neutralized with 3 \i\ of Tris solution. Alternatively, bound phage 
were eluted by incubating the wells with 50 |il of TBS containing the antigen (1-10 

30 |iM). Recovered phage were amplified using the standard protocol employing the 
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XLlBlue cells as the host (Sambrook, et ah). The selection process was repeated 5-6 
times to concentrate positive clones. After the final round, individual clones were 
picked and their binding affinities and DNA sequences were determined. 

The binding affinities of monobodies on the phage surface were characterized 
using the phage ELISA technique (Li, et al 9 1995). Wells of microtiter plates (Nunc) 
were coated with an antigen and blocked with BSA Purified phages (10 8 - 10 11 cfu) 
originating from a single colony were added to each well and incubated 2 hours at 37 
°C. After washing wells with an appropriate buffer (see above), bound phage were 
detected by the standard ELISA protocol using antiM13 antibody (rabbit, Sigma) and 
anti-rabbit Ig-peroxidase conjugate (Pierce). Colorimetric assays were performed 
using Turbo-TMB (3,3',5,5 f -tetramethylbenzidine, Pierce) as a substrate. 

The binding affinities of monobodies on the phage surface were further 
characterized using the competition ELISA method (Djavadi-Ohaniance, et ai, 1996). 
In this experiment, phage ELISA is performed in the same manner as described above, 
except that the phage solution contains a ligand at varied concentrations. The phage 
solution was incubated a 4°C for one hour prior to the binding of an immobilized 
ligand in a microtiter plate well. The affinities of phage displayed monobodies are 
estimated by the decrease in ELISA signal as the free ligand concentration is 
increased. 

After preliminary characterization of monobodies displayed on the surface of 
phage using phage ELISA, genes for positive clones were subcloned into the 
expression vector pAS45. E. coli BL21(DE3) (Novagen) was transformed with an 
expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
medium and M9 medium supplemented with Bactotryptone (Difco) containing 
ampicillin (200 |ig/ml). For isotopic labeling, I5 N NH 4 C1 and/or 13 C glucose replaced 
unlabeled components. Stable isotopes were purchased from Isotec and Cambridge 
Isotope Labs. 500 ml medium in a 2 1 baffle flask was inoculated with 10 ml of 
overnight culture and agitated at approximately 140 rpm at 37°C. IPTG was added at 
a final concentration of 1 mM to induce protein expression when OD(600 nm) reached 
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approximately 1 .0. The cells were harvested by centrifugation 3 hours after the 
addition of IPTG and kept frozen at -70°C until used. 

Fn3 and monobodies with His»tag were purified as follows. Cells were 
suspended in 5 ml/(g cell) of 50 mM Tris (pH 7.6) containing 1 mM 
phenylmethylsulfonyl fluoride. HEL (Sigma, 3X crystallized) was added to a final 
concentration of 0.5 mg/ml. After incubating the solution for 30 min at 37 0 C, it was 
sonicated so as to cause cell breakage three times for 30 seconds on ice. Cell debris 
was removed by centrifugation at 15,000 rpm in an Sorval RC-2B centrifuge using an 
SS-34 rotor. Concentrated sodium chloride is added to the solution to a final 
concentration of 0.5 M. The solution was then applied to a 1 ml HISTRAP™ 
chelating column (Pharmacia) preloaded with nickel chloride (0.1 M, 1 ml) and 
equilibrated in the Tris buffer (50 mM, pH 8.0) containing 0.5 M sodium chloride. 
After washing the column with the buffer, the bound protein was eluted with a Tris 
buffer (50 mM, pH 8.0) containing 0.5 M imidazole. The His«tag portion was cleaved 
off, when required, by treating the fusion protein with thrombin using the protocol 
supplied by Novagen (Madison, WI). Fn3 was separated from the His»tag peptide and 
thrombin by a RESOURCES® column (Pharmacia) using a linear gradient of sodium 
chloride (0-0.5 M) in sodium acetate buffer (20 mM, pH 5.0). 

Small amounts of soluble monobodies were prepared as follows. XL-1 Blue 
cells containing pAS38 derivatives (plasmids coding Fn3-pIII fusion proteins) were 
grown in LB media at 37°C with vigorous shaking until OD(600 nm) reached 
approximately 1.0; IPTG was added to the culture to a final concentration of 1 mM, 
and the cells were further grown overnight at 37°C Cells were removed from the 
medium by centrifugation, and the supernatant was applied to a microtiter well coated 
with a ligand. Although XL-1 Blue cells containing pAS38 and its derivatives express 
FN3-pIII fusion proteins, soluble proteins are also produced due to the cleavage of the 
linker between the Fn3 and pill regions by proteolytic activities of E. coli (Rosenblum 
& Barbas, 1995). Binding of a monobody to the ligand was examined by the standard 
ELIS A protocol using a custom antibody against Fn3 (purchased from Cocalico 
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Biologicals, Reamstown, PA). Soluble monobodies obtained from the periplasmic 
fraction of E. coli cells using a standard osmotic shock method were also used. 

EXAMPLE X 

5 Ubiquitin binding monobody 

Ubiquitin is a small (76 residue) protein involved in the degradation pathway 
in eurkaryotes. It is a single domain globular protein. Yeast ubiquitin was purchased 
from Sigma Chemical Company and was used without further purification. 

Libraries 2 and 4, described in Example VI above, were used to select 

10 ubiquitin-binding monobodies. Ubiquitin (1 |ig in 50 |il sodium bicarbonate buffer 
(100 mM, pH 8.5)) was immobilized in the wells of a microtiter plate, followed by 
blocking with BSA (3% in TBS). Panning was performed as described above. In the 
first two rounds, 1 |ig of ubiquitin was immobilized per well, and bound phage were 
elute with an acidic solution. From the third to the sixth rounds, 0.1 \ig of ubiquitin 

15 was immobilized per well and the phage were eluted either with an acidic solution or 
with TBS containing 10 \iM ubiquitin. 

Binding of selected clones was tested first in the polyclonal mode, i.e., before 
isolating individual clones. Selected clones from all libraries showed significant 
binding to ubiquitin. These results are shown in Figure 9. The binding to the 

20 immobilized ubiquitin of the clones was inhibited almost completely by less than 30 
|im soluble ubiquitin in the competition ELISA experiments (see Fig. 10). The 
sequences of the BC and FG loops of ubiquitin-binding monobodies is shown in Table 
3. 
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Table 3. Sequences of ubiquitin-binding monobodies 



10 



Occurrence 



Name 


BC loop 


FG loop 


(if more 








than one) 


211 


CARRA (SEQ ID NO:31) 


RWIPLAK (SEQ ID NO:32) 


2 


212 


CWRRA (SEQ ID NO:33) 


RWVGLAW (SEQ ID NO:34) 




213 


CKHRR(SEQIDNO:35) 


FADLWWR (SEQ ID NO:36) 




214 


CRRGR (SEQ ID NO:37) 


RGFMWLS (SEQ ID NO:38) 




215 


CNWRR (SEQ ID NO:39) 


RAYRYRW (SEQ ID NO:40) 




411 


SRLRR(SEQIDNO:41) 


PPWRV(SEQIDNO:42) 


9 


422 


ARWTL (SEQ ID NO:43) 


RRWWW (SEQ ID NO:44) 




424 


GQRTF (SEQ ID NO:45) 


RRWWA (SEQ ID NO:46) 





The 411 clone, which was the most enriched clone, was characterized using 
15 phage ELISA. The 411 clone showed selective binding and inhibition of binding in 
the presence of about 10 \xM ubiquitin in solution (Fig. 1 1). 



EXAMPLE XI 
Methods for the immobilization of small molecules 

20 Target molecules were immobilized in wells of a microtiter plate 

(MAXISORP, Nunc) as described hereinbelow, and the wells were blocked with 
BSA. In addition to the use of carrier protein as described below, a conjugate of a 
target molecule in biotin can be made. The biotinylated ligand can then be 
immobilized to a microtiter plate well which has been coated with streptavidin. 

25 In addition to the use of a carrier protein as described below, one could make a 

conjugate of a target molecule and biotin (Pierce) and immobilize a biotinylated 
ligand to a microtiter plate well which has been coated with streptavidin (Smith and 
Scott, 1993). 
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Small molecules may be conjugated with a carrier protein such as bovine 
serum albumin (BSA, Sigma), and passively adsorbed to the microtiter plate well. 
Alternatively, methods of chemical conjugation can also be used. In addition, solid 
supports other than microtiter plates can readily be employed. 



Fluorescein has been used as a target for the selection of antibodies from 

combinatorial libraries (Barbas, et al 1992). NHS-fluorescein was obtained from 
10 Pierce and used according to the manufacturer's instructions in preparing conjugates 

with BSA (Sigma). Two types of fluorescein-BSA conjugates were prepared with 

approximate molar ratios of 17 (fluorescein) to one (BSA). 

The selection process was repeated 5-6 times to concentrate positive clones. 

In this experiment, the phage library was incubated with a protein mixture (BSA, 
15 cytochrome C (Sigma, Horse) and RNaseA (Sigma, Bovine), 1 mg/ml each) at room 

temperature for 30 minutes, prior to the addition to ligand coated wells. Bound phage 

were eluted in TBS containing 10 |iM soluble fluorescein, instead of acid elution. 

After the final round, individual clones were picked and their binding affinities (see 

below) and DNA sequences were determined. 
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EXAMPLE XII 



Fluorescein binding monobody 



20 
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Table 4. Clones from Library #2 







BC 


EG 




WT 


AVTVR (SEQ ID NO:47) 


RGDSPAS (SEQ ID NO:48) 


5 


pLB24.1 


CNWRR (SEQ ID NO:49) 


RAYRYRW (SEQ ID NO:50) 




pLB24.2 


CMWRA (SEQIDNO:51) 


RWGMLRR (SEQ ID NO: 52) 




pLB24.3 


ARMRE (SEQ ID NO:53) 


RWLRGRY (SEQ ID NO:54) 




pLB24.4 


CARRR (SEQ ID NO:55) 


RRAGWGW (SEQ ID NO:56) 




pLB24.5 


CNWRR (SEQ ID NO:57) 


RAYRYRW (SEQ ID NO:58) 


10 


pLB24.6 


RWRER(SEQIDNO:59) 


RHPWTER (SEQ ID NO:60) 




pLB24.7 


CNWRR (SEQIDNO:61) 


RAYRYRW (SEQ ID NO:62) 




pLB24.8 


ERR VP (SEQ ID NO:63) 


RLLLWQR (SEQ ID NO:64) 




pLB24.9 


GRGAG (SEQ ID NO:65) 


FGSFERR (SEQ ID NO:66) 




pLB24.11 


CRWTR (SEQ ID NO:67) 


RRWFDGA (SEQ ID NO:68) 


15 


pLB 24.12 


CNWRR (SEQ ID NO:69) 


RAYRYRW (SEQ ID NO:70) 






Clones from Library #4 




WT 


AVTVR (SEQ ID NO:71) 


GRGDS (SEQ ID NO:72) 


20 


pLB25.1 


GQRTF (SEQ ID NO:73) 


RRWWA (SEQ ID NO:74) 




pLB25.2 


GQRTF (SEQ ID NO:75) 


RRWWA (SEQ ID NO:76) 




pLB25.3 


GQRTF (SEQ ID NO:77) 


RRWWA (SEQ ID NO:78) 






LRYRS (SEQ ID NO:79) 


GWRWR (SEQ ID NO:80) 




pLB25.5 


GQRTF (SEQ ID NO:81) 


RRWWA (SEQ ID NO:82) 


25 


pLB25.6 


GQRTF (SEQ ID NO:83) 


RRWWA (SEQ ID NO:84) 




pLB25.7 


LRYRS (SEQ ID NO:85) 


GWRWR (SEQ ID NO:86) 




pLB25.9 


LRYRS (SEQ ID NO:87) 


GWRWR (SEQ ID NO:88) 
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pLB25.1 1 GQRTF (SEQ ID NO:89) RRWWA (SEQ ID NO:90) 

pLB25.12 LRYRS (SEQ ID N0:91) GWRWR (SEQ ID NO:92) 



Preliminary characterization of the binding affinities of selected clones were 
5 performed using phage ELISA and competition phage ELISA (see Fig. 12 

(Fluorescein- 1) and Fig. 13 (Fluorescein-2)). The four clones tested showed specific 
binding to the ligand-coated wells, and the binding reactions are inhibited by soluble 
fluorescein (see Fig. 1 3). 



10 EXAMPLE XIII 

Digoxigenin binding monobody 

Digoxigenin-3 -O-methy 1-carbonyl-e-aminocapronic acid-NHS (Boehringer 
Mannheim) is used to prepare a digoxigenin-BSA conjugate. The coupling reaction is 
performed following the manufacturers 1 instructions. The digoxigenin-BSA conjugate 
15 is immobilized in the wells of a microtiter plate and used for panning. Panning is 
repeated 5 to 6 times to enrich binding clones. Because digoxigenin is sparingly 
soluble in aqueous solution, bound phages are eluted from the well using acidic 
solution. See Example XIV. 



20 EXAMPLE XIV 

TSAC (transition state analog compound) binding monobodies 

Carbonate hydrolyzing monobodies are selected as follows. A transition state 
analog for carbonate hydrolysis, 4-nitrophenyl phosphonate is synthesized by an 
Arbuzov reaction as described previously (Jacobs and Schultz, 1987). The 
25 phosphonate is then coupled to the carrier protein, BSA, using carbodiimide, followed 
by exhaustive dialysis (Jacobs and Schultz, 1987). The hapten-BSA conjugate is 
immobilized in the wells of a microtiter plate and monobody selection is performed as 
described above. Catalytic activities of selected monobodies are tested using 
4-nitrophenyl carbonate as the substrate. 
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Other haptens useful to produce catalytic monobodies are summarized in H. 
Suzuki (1994) and in N. R. Thomas (1994). 

EXAMPLE XV 

5 NMR characterization of Fn3 and comparison of the Fn3 

secreted by yeast with that secreted by E. coli 

Nuclear magnetic resonance (NMR) experiments are performed to identify the 
contact surface between FnAb and a target molecule, e.g., monobodies to fluorescein, 
ubiquitin, RNaseA and soluble derivatives of digoxigenin. The information is then be 

10 used to improve the affinity and specificity of the monobody. Purified monobody 
samples are dissolved in an appropriate buffer for NMR spectroscopy using Amicon 
ultrafiltration cell with a YM-3 membrane. Buffers are made with 90 % H 2 O/10 % 
D 2 0 (distilled grade, Isotec) or with 100 % D 2 0. Deuterated compounds (e.g. acetate) 
are used to eliminate strong signals from them. 

1 5 NMR experiments are performed on a Varian Unity INOVA 600 spectrometer 

equipped with four RF channels and a triple resonance probe with pulsed field 
gradient capability. NMR spectra are analyzed using processing programs such as 
FELIX (Molecular Simulations), NMRPIPE, PIPP, and CAPP (Garrett, et aL, 1991; 
Delaglio, et al, 1995) on UNIX workstations. Sequence specific resonance 

20 assignments are made using well-established strategy using a set of triple resonance 
experiments (CBCA(CO)NH and HNCACB) (Grzesiek & Bax, 1992; Wittenkind & 
Mueller, 1993). 

Nuclear Overhauser effect (NOE) is observed between *H nuclei closer than 
approximately 5 A, which allows one to obtain information on interproton distances. 

25 A series of double- and triple-resonance experiments (Table 5; for recent reviews on 
these techniques, see Bax & Grzesiek, 1993 and Kay, 1995) are performed to collect 
distance (i.e. NOE) and dihedral angle (J-coupling) constraints. Isotope-filtered 
experiments are performed to determine resonance assignments of the bound ligand 
and to obtain distance constraints within the ligand and those between FnAb and the 

30 ligand. Details of sequence specific resonance assignments and NOE peak 
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assignments have been described in detail elsewhere (Clore & Gronenborn, 1991; 
Pascal, et al, 1994b; Metzler, et al, 1996). 

Table 5. NMR experiments for structure characterization 



^ Experiment Name Reference 

^ 1. reference spectra 



2D-'H, 15 N-HSQC (Bodenhausen & Ruben, 1 980; Kay, et al , 1 992) 

10 2D-'H, 13 C-HSQC (Bodenhausen & Ruben, 1980; Vuister & Bax, 

1992). 



2. backbone and side chain resonance assignments of l3 C/ 15 N-labeled protein 



15 3D-CBCA(CO)NH 

3D-HNCACB 

3D-C(CO)NH 

3D-H(CCO)NH 

3D-HBHA(CBCACO)NH 
20 3D-HCCH-TOCSY 

3D-HCCH-COSY 

3D-'H, 15 N-TOCSY-HSQC 

2D-HB (CB CDCE)HE 



(Grzesiek & Bax, 1992) 

(Wittenkind & Mueller, 1993) 

(Logan et al, 1992; Grzesiek et al, 1993) 

(Grzesiek & Bax, 1993) 
(Kayefa/., 1993) 
(Ikuraefa/., 1991) 
(Zhang et al, 1994) 
(Yamazaki et al, 1993) 



25 3. resonance assignments of unlabeled ligand 

2D-isotope-filtered 'H-TOCSY 
2D-isotope-filtered 'H-COSY 

2D-isotope-filtered 'H-NOES Y (lkura & Bax, 1 992) 
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4. structural constraints 



within labeled protein 
3D- ! H, ,5 N-NOESY-HSQC 



(Zhang etal, 1994) 



4D- 1 H, I3 C-HMQC-NOESY-HMQC (Vuister etal, 1993) 
5 4D- 1 H, !3 C, 15 N-HSQC-NOESY-HSQC (Muhandiram et al, 1993; Pascal et al, 



within unlabeled ligand 

2D-isotope-filtered 'H-NOESY (Ikura & Bax, 1992) 
interactions between protein and ligand 
10 3D-isotope-filtered % ,5 N-NOESY-HSQC 

3D-isotope-filtered % ,3 C-NOESY-HSQC (Lee etal., 1994) 

5. dihedral angle constraints 

1 5 J-molulated % 15 N-HSQC (Billeter et aL , 1 992) 

3D-HNHB (Archer etal, 1991) 

Backbone *H, 15 N and 13 C resonance assignments for a monobody are 
compared to those for wild-type Fn3 to assess structural changes in the mutant. Once 

20 these data establish that the mutant retains the global structure, structural refinement is 
performed using experimental NOE data. Because the structural difference of a 
monobody is expected to be minor, the wild-type structure can be used as the initial 
model after modifying the amino acid sequence. The mutations are introduced to the 
wild-type structure by interactive molecular modeling, and then the structure is 

25 energy-minimized using a molecular modeling program such as QUANTA (Molecular 
Simulations). Solution structure is refined using cycles of dynamical simulated 
annealing (Nilges et aL, 1988) in the program X-PLOR (Briinger, 1992). Typically, 
an ensemble of fifty structures is calculated. The validity of the refined structures is 
confirmed by calculating a fewer number of structures from randomly generated 

30 initial structures in X-PLOR using the YASAP protocol (Nilges, et aL, 1991). 



1994a) 
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Structure of a monobody-ligand complex is calculated by first refining both 
components individually using intramolecular NOEs, and then docking the two using 
intermolecular NOEs. 

For example, the *H, 15 N-HSQC spectrum for the fluorescein-binding 
5 monobody LB25.5 is shown in Figure 14. The spectrum shows a good dispersion 
(peaks are spread out) indicating that LB25.5 is folded into a globular conformation. 
Further, the spectrum resembles that for the wild-type Fn3, showing that the overall 
structure of LB25.5 is similar to that of Fn3. These results demonstrate that ligand- 
binding monobodies can be obtained without changing the global fold of the Fn3 
10 scaffold. 

Chemical shift perturbation experiments are performed by forming the 
complex between an isotope-labeled FnAb and an unlabeled ligand. The formation of 
a stoichiometric complex is followed by recording the HSQC spectrum. Because 
chemical shift is extremely sensitive to nuclear environment, formation of a complex 

15 usually results in substantial chemical shift changes for resonances of amino acid 
residues in the interface. Isotope-edited NMR experiments (2D HSQC and 3D 
CBCA(CO)NH) are used to identify the resonances that are perturbed in the labeled 
component of the complex; i.e. the monobody. Although the possibility of artifacts 
due to long-range conformational changes must always be considered, substantial 

20 differences for residues clustered on continuous surfaces are most likely to arise from 
direct contacts (Chen et ai, 1993; Gronenborn & Clore, 1993). 

An alternative method for mapping the interaction surface utilizes amide 
hydrogen exchange (HX) measurements. HX rates for each amide proton are 
measured for 15 N labeled monobody both free and complexed with a ligand. Ligand 

25 binding is expected to result in decreased amide HX rates for monobody residues in 
the interface between the two proteins, thus identifying the binding surface. HX rates 
for monobodies in the complex are measured by allowing HX to occur for a variable 
time following transfer of the complex to D 2 0; the complex is dissociated by lowering 
pH and the HSQC spectrum is recorded at low pH where amide HX is slow. Fn3 is 

30 stable and soluble at low pH, satisfying the prerequisite for the 
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experiments. 



EXAMPLE XVI 



10 



15 



20 



25 



Construction and Analysis of Fn3-Display System Specific for Ubiquitin 

An Fn3 -display system was designed and synthesized, ubiquitin-binding 
clones were isolated and a major Fn3 mutant in these clones was biophysically 
characterized. 

Gene construction and phage display of Fn3 was performed as in Examples I 
and II above. The Fn3 -phage pill fusion protein was expressed from a phagemid- 
display vector, while the other components of the Ml 3 phage, including the wildtype 
pill, were produced using a helper phage (Bass et al. 9 1990). Thus, a phage produced 
by this system should contain less than one copy of Fn3 displayed on the surface. The 
surface display of Fn3 on the phage was detected by ELISA using an anti-Fn3 
antibody. Only phages containing the Fn3-pIII fusion vector reacted with the 
antibody. 

After confirming the phage surface to display Fn3, a phage display library of 
Fn3 was constructed as in Example III. Random sequences were introduced in the BC 
and FG loops. In the first library, five residues (77-81) were randomized and three 
residues (82-84) were deleted from the FG loop. The deletion was intended to reduce 
the flexibility and improve the binding affinity of the FG loop. Five residues (26-30) 
were also randomized in the BC loop in order to provide a larger contact surface with 
the target molecule. Thus, the resulting library contains five randomized residues in 
each of the BC and FG loops (Table 6). This library contained approximately 10 8 
independent clones. 

Library Screening 

Library screening was performed using ubiquitin as the target molecule. In 
each round of panning, Fn3 -phages were absorbed to a ubiquitin-coated surface, and 
bound phages were eluted competitively with soluble ubiquitin. The recovery ratio 
improved from 4.3 x 10" 7 in the second round to 4.5 x 10" 6 in the fifth round, 
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suggesting an enrichment of binding clones. After five founds of panning, the amino 
acid sequences of individual clones were determined (Table 6). 
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A clone, dubbed Ubi4, dominated the enriched pool of Fn3 variants. Therefore, 
further investigation was focused on this Ubi4 clone. Ubi4 contains four mutations in 
the BC loop (Arg 30 in the BC loop was conserved) and five mutations and three 
deletions in the FG loop. Thus 13% (12 out of 94) of the residues were altered in 
Ubi4 from the wild-type sequence. 

Figure 15 shows a phage ELISA analysis of Ubi4. The Ubi4 phage binds to 
the target molecule, ubiquitin, with a significant affinity, while a phage displaying the 
wild-type Fn3 domain or a phase with no displayed molecules show little detectable 
binding to ubiquitin (Figure 15a). In addition, the Ubi4 phage showed a somewhat 
elevated level of background binding to the control surface lacking the ubiquitin 
coating. A competition ELISA experiments shows the IC 50 (concentration of the free 
ligand which causes 50% inhibition of binding) of the binding reaction is 
approximately 5 \iM (Fig. 15b). BSA, bovine ribonuclease A and cytochrome C 
show little inhibition of the Ubi4-ubiquitin binding reaction (Figure 15c), indicating 
that the binding reaction of Ubi4 to ubiquitin does result from specific binding. 

Characterization of a Mutant Fn3 Protein 

The expression system yielded 50-100 mg Fn3 protein per liter culture. A 
similar level of protein expression was observed for the Ubi4 clone and other mutant 
Fn3 proteins. 

Ubi4-Fn3 was expressed as an independent protein. Though a majority of 
Ubi4 was expressed in E. coli as a soluble protein, its solubility was found to be 
significantly reduced as compared to that of wild-type Fn3. Ubi4 was soluble up to 
~20 |iM at low pH, with much lower solubility at neutral pH. This solubility was not 
high enough for detailed structural characterization using NMR spectroscopy or X-ray 
crystallography. 

The solubility of the Ubi4 protein was improved by adding a solubility tail, 
GKKGK, as a C-terminal extension. The gene for Ubi4-Fn3 was subcloned into the 
expression vector pAS45 using PCR. The C-terminal solubilization tag, GKKGK, 
was incorporated in this step. E. coli BL21 (DE3) (Novagen) was transformed with 
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the expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
media and M9 media supplemented with Bactotryptone (Difco) containing ampicillin 
(200 |ig/ml). For isotopic labeling, 15 N NH 4 C1 replaced unlabeled NH 4 C1 in the 
media. 500 ml medium in a 2 liter baffle flask was inoculated with 10 ml of overnight 
culture and agitated at 37°C. IPTG was added at a final concentration of 1 mM to 
initiate protein expression when OD (600 nm) reaches one. The cells were harvested 
by centrifugation 3 hours after the addition of IPTG and kept frozen at 70°C until 
used. 

Proteins were purified as follows. Cells were suspended in 5 ml/(g cell) of 
Tris (50 mM, pH 7.6) containing phenylmethylsulfonyl fluoride (1 mM). Hen egg 
lysozyme (Sigma) was added to a final concentration of 0.5 mg/ml. After incubating 
the solution for 30 minutes at 37° C, it was sonicated three times for 30 seconds on ice. 
Cell debris was removed by centrifugation. Concentrated sodium chloride was added 
to the solution to a final concentration of 0.5 M. The solution was applied to a 
HI-TRAP chelating column (Pharmacia) preloaded with nickel and equilibrated in the 
Tris buffer containing sodium chloride (0.5 M). After washing the column with the 
buffer, histag-Fn3 was eluted with the buffer containing 500 mM imidazole. The 
protein was further purified using a RESOURCES® column (Pharmacia) with a NaCl 
gradient in a sodium acetate buffer (20 mM, pH 4.6). 

With the GKKGK (SEQ ID NO:109) tail, the solubility of the Ubi4 protein 
was increased to over 1 mM at low pH and up to -50 \iM at neutral pH. Therefore, 
further analyses were performed on Ubi4 with this C-terminal extension (hereafter 
referred to as Ubi4-K). It has been reported that the solubility of a minibody could be 
significantly improved by addition of three Lys residues at the N- or C-termini 
(Bianchi et al. 9 1994). In the case of protein Rop, a non-structured C-terminal tail is 
critical in maintaining its solubility (Smith et aL, 1995). 

Oligomerization states of the Ubi4 protein were determined using a size 
exclusion column. The wild-type Fn3 protein was monomeric at low and neutral 
pH's. However, the peak of the Ubi4-K protein was significantly broader than that of 
wild-type Fn3, and eluted after the wild- type protein. This suggests interactions 
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between Ubi4-K and the column material, precluding the use of size exclusion 
chromatography to determine the oligomerization state of Ubi4. NMR studies suggest 
that the protein is monomelic at low pH. 

The Ubi4-K protein retained a binding affinity to ubiquitin as judged by 
5 ELISA (Figure 15d). However, an attempt to determine the dissociation constant 
using a biosensor (Affinity Sensors, Cambridge, U.K.) failed because of high 
background binding of Ubi4-K-Fn3 to the sensor matrix. This matrix mainly consists 
of dextran, consistent with our observation that interactions between Ubi4-K interacts 
with the cross-linked dextran of the size exclusion column. 

10 

Example XVII 
Stability Measurements of Monobodies 

Guanidine hydrochloride (GuHCl)-induced unfolding and refolding reactions 
were followed by measuring tryptophan fluorescence. Experiments were performed 

15 on a Spectronic AB-2 spectrofluorometer equipped with a motor-driven syringe 

(Hamilton Co.). The cuvette temperature was kept at 30°C. The spectrofluorometer 
and the syringe were controlled by a single computer using a home-built interface. 
This system automatically records a series of spectra following GuHCl titration. An 
experiment started with a 1 .5 ml buffer solution containing 5 |iM protein. An 

20 emission spectrum (300-400 nm; excitation at 290 nm) was recorded following a 
delay (3-5 minutes) after each injection (50 or 100 |il) of a buffer solution containing 
GuHCl. These steps were repeated until the solution volume reached the full capacity 
of a cuvette (3.0 ml). Fluorescence intensities were normalized as ratios to the 
intensity at an isofluorescent point which was determined in separate experiments. 

25 Unfolding curves were fitted with a two-state model using a nonlinear least-squares 
routine (Santoro & Bolen, 1988). No significant differences were observed between 
experiments with delay times (between an injection and the start of spectrum 
acquisition) of 2 minutes and 10 minutes, indicating that the unfolding/refolding 
reactions reached close to an equilibrium at each concentration point within the delay 

30 times used. 
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Conformational stability of Ubi4-K was measured using above-described 
GuHCl-induced unfolding method. The measurements were performed under two sets 
of conditions; first at pH 3.3 in the presence of 300 mM sodium chloride, where 
Ubi4-K is highly soluble, and second in TBS, which was used for library screening. 
5 Under both conditions, the unfolding reaction was reversible, and we detected no 
signs of aggregation or irreversible unfolding. Figure 16 shows unfolding transitions 
of Ubi4-K and wild-type Fn3 with the N-terminal (his) 6 tag and the C-terminal 
solubility tag. The stability of wild-type Fn3 was not significantly affected by the 
addition of these tags. Parameters characterizing the unfolding transitions are listed in 
10 Table 7. 



Table 7. Stability parameters for Ubi4 and wild-type Fn3 as determined by GuHCl- 
induced unfolding 



15 Protein AG 0 (kcal mol" 1 ) m G (kcal mol 1 M" 1 ) 



20 



Ubi4 (pH 7.5) 


4.8 ±0.1 


2.12 ±0.04 


Ubi4 (pH 3.3) 


6.5 ±0.1 


2.07 ± 0.02 


Wild-type (pH 7.5) 


7.2 ±0.2 


1.60 ±0.04 


Wild-type (pH 3.3) 


11.2 ±0.1 


2.03 ± 0.02 



AG 0 is the free energy of unfolding in the absence of denaturant; m G is the dependence 
of the free energy of unfolding on GuHCl concentration. For solution conditions, see 
Figure 4 caption. 



25 Though the introduced mutations in the two loops certainly decreased the stability of 
Ubi4-K relative to wild-type Fn3, the stability of Ubi4 remains comparable to that of a 
"typical" globular protein. It should also be noted that the stabilities of the wild-type 
and Ubi4-K proteins were higher at pH 3.3 than at pH 7.5. 
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The Ubi4 protein had a significantly reduced solubility as compared to that of 
wild-type Fn3, but the solubility was improved by the addition of a solubility tail. 
Since the two mutated loops comprise the only differences between the wild-type and 
Ubi4 proteins, these loops must be the origin of the reduced solubility. At this point, 
5 it is not clear whether the aggregation of Ubi4-K is caused by interactions between the 
loops, or by interactions between the loops and the invariable regions of the Fn3 
scaffold. 

The Ubi4-K protein retained the global fold of Fn3, showing that this scaffold 
can accommodate a large number of mutations in the two loops tested. Though the 

10 stability of the Ubi4-K protein is significantly lower than that of the wild-type Fn3 
protein, the Ubi4 protein still has a conformational stability comparable to those for 
small globular proteins. The use of a highly stable domain as a scaffold is clearly 
advantageous for introducing mutations without affecting the global fold of the 
scaffold In addition, the GuHCl-induced unfolding of the Ubi4 protein is almost 

15 completely reversible. This allows the preparation of a correctly folded protein even 
when a Fn3 mutant is expressed in a misfolded form, as in inclusion bodies. The 
modest stability of Ubi4 in the conditions used for library screening indicates that Fn3 
variants are folded on the phage surface. This suggests that a Fn3 clone is selected by 
its binding affinity in the folded form, not in a denatured form. Dickinson et al 

20 proposed that Val 29 and Arg 30 in the BC loop stabilize Fn3. Val 29 makes contact 
with the hydrophobic core, and Arg 30 forms hydrogen bonds with Gly 52 and Val 75. 
In Ubi4-Fn3, Val 29 is replaced with Arg, while Arg 30 is conserved. The FG loop 
was also mutated in the library. This loop is flexible in the wild-type structure, and 
shows a large variation in length among human Fn3 domains (Main et aL 9 1992). 

25 These observations suggest that mutations in the FG loop may have less impact on 
stability. In addition, the N- terminal tail of Fn3 is adjacent to the molecular surface 
formed by the BC and FG loops (Figure 1 and 17) and does not form a well-defined 
structure. Mutations in the N-terminal tail would not be expected to have strong 
detrimental effects on stability. Thus, residues in the N-terminal tail may be good 

30 sites for introducing additional mutations. 
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Example XVIII 
NMR Spectroscopy of Ubi4-Fn3 



10 



15 



20 



25 



Ubi4-Fn3 was dissolved in [ 2 H]-Gly HC1 buffer (20 mM, pH 3.3) containing 
NaCl (300 mM) using an Amicon ultrafiltration unit. The final protein concentration 
was 1 mM. NMR experiments were performed on a Varian Unity INOVA 600 
spectrometer equipped with a triple-resonance probe with pulsed field gradient The 
probe temperature was set at 30°C. HSQC, TOCSY-HSQC and NOESY-HSQC 
spectra were recorded using published procedures (Kay et al 9 1992; Zhang et al. 9 
1994). NMR spectra were processed and analyzed using the NMRPIPE and 
NMRVIEW software (Johnson & Blevins, 1994; Delaglio et aL, 1995) on UNIX 
workstations. Sequence-specific resonance assignments were made using standard 
procedures (Wiithrich, 1986; Clore & Gronenborn, 1991). The assignments for 
wild- type Fn3 (Baron et aL, 1992) were confirmed using a 15 N-labeled protein 
dissolved in sodium acetate buffer (50 mM, pH 4.6) at 30°C. 

The three-dimensional structure of Ubi4-K was characterized using this 
heteronuclear NMR spectroscopy method. A high quality spectrum could be collected 
on a 1 mM solution of 15 N-labeled Ubi4 (Figure 17a) at low pH. The linewidth of 
amide peaks of Ubi4-K was similar to that of wild-type Fn3, suggesting that Ubi4-K 
is monomeric under the conditions used. Complete assignments for backbone *H and 
15 N nuclei were achieved using standard *H, 15 N double resonance techniques, except 
for a row of His residues in the N-terminal (His) 6 tag. There were a few weak peaks 
in the HSQC spectrum which appeared to originate from a minor species containing 
the N-terminal Met residue. Mass spectroscopy analysis showed that a majority of 
Ubi4-K does not contain the N-terminal Met residue. Fig. 17 shows differences in 
'HN and 15 N chemical shifts between Ubi4-K and wild-type Fn3. Only small 
differences are observed in the chemical shifts, except for those in and near the 
mutated BC and FG loops. These results clearly indicate that Ubi4-K retains the 
global fold of Fn3, despite the extensive mutations in the two loops. A few residues 
in the N-terminal region, which is close to the two mutated loops, also exhibit 
significant chemical differences between the two proteins. An HSQC spectrum was 
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also recorded on a 50 \iM sample of Ubi4-K in TBS. The spectrum was similar to that 
collected at low pH, indicating that the global conformation of Ubi4 is maintained 
between pH 7.5 and 3.3. 



cited herein are incorporated by reference as if individually incorporated. The 
foregoing detailed description and examples have been given for clarity of 
understanding only. No unnecessary limitations are to be understood therefrom. The 
invention is not limited to the exact details shown and described for variations 
10 obvious to one skilled in the art will be included within the invention defined by the 
claims. 



5 



The complete disclosure of all patents, patent documents and publications 
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